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ABSTRACT 

Each of the four issues of this newsletter published 
in 1993 consists of one article dealing with a particular policy 
debate. Number 1, "Canadian Special Education Policies: Children with 
Learning Disabilities in a Bilingual and Multicultural Society" 

(Linda S. Siegel and Judith Wiener), discusses social and cultural 
factors affecting the education of children with learning 
di s ab i 1 i t i es , including policy issues, and bilingualism and 
mul t i cul t ural i sm. This issue contains 34 references. Number 2, "Using 
Research and Theory To Justify and Inform Head Start Expansion" 
(Edward Zigler and Sally J. Styfco) , examines Head Start's programs 
and goals, and addresses issues surrounding the evaluation of Head 
Start, policy directions (including issues of quantity and quality), 
and planning for a better, bigger Head Start program. This issue 
contains 87 references. Number 3, "Child Witnesses: Translating 
Research into Policy" (Stephen J. Ceci and Maggie Bruck) , begins with 
a discussion of the McMartin preschool (Manhattan Beach, California) 
child sexual abuse case, and moves on to a discussion of the 
prevalence of abuse and court involvement. Research on children's 
sugges t ibi 1 i ty , including past and present trends, are discussed as 
are other issues, including policy implications for expert witnesses, 
the relationship of research to clinical practice, and professional 
organizations as ethical gatekeepers. This issue contains 97 
references. Number 4, "Integrating Science and Ethics in Research 
with High-Risk Children and Youth (Celia B. Fisher), deals with 
federal and professional guidelines for research with children and 
youth, and such issues as informed consent, problems identified 
during the course of research, generalization of research results to 
individuals and diverse populations, and recommendations for future 
policy. This issue contains 102 references. (DR) 
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Introduction 

Canada is a vast and diverse country with a 
relatively small population of approximately 27 
million. Although on a superficial level it may 
seem similar to the United States in economic 
structure, language, and culture, we will argue in 
this report that there are some important differ- 
ences. We will discuss how societal and cultural 
factors affect the education of children, particu- 
larly those with learning disabilities. Of special 
interest will be the role of bilingualism and 
multiculturalism in Canada’s approach to the 
education of its children, and we will focus on 
areas that distinguish Canada from the U.S. 

Present-day Canada has been characterized 
as a “vertical mosaic” comprised of a unique 
mixture of several cultures which are loosely 
organized hierarchically along the lines of eco- 
nomics and political power (Porter, 1965, 1987). 
These cultures include the native peoples, or First 



Nations, who were the original inha bitants; Cana- 
dians of French descent who settled in Canada 
during the 1 7 th and 1 8th centuries ; and Canadians 
of British descent who came to Canada from 
Britain from the 1 7 th century onwards or from the 
United States during the American Revolution in 
the latter part of the 18th century. The late 19th 
and 20th centuries brought three additional waves 
of immigration. In the latter part of the 19th 
century and early 20th century, immigrants from 
Asia, mainly China, came to the west coast of 
Canada. Immigrants from various European coun- 
tries came to central Canada and the prairie prov- 
inces during the early 20th century and following 
World War II. More recently, Canada has experi- 
enced a new influx of immigrants and refugees, 
mostly from countries in economic or political 
turmoil : from Latin America, the Caribbean, Asia, 
Africa, the Indian subcontinent, the former Soviet 
Union, and the Middle East. All these cultural 
groups have had a major influence on Canadian 
society and its educational system. 
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Societal and Cultural Factors Affecting the 

Education of Children with Learning 
Disabilities 

In this report we will address each of five 
societal and cultural factors which we have iden- 
tified as important determinants of the education 
of children with learning disabilities in Canada. 
The first two relate to policy issues, the remaining 
three to issues of bilingualism and multi- 
culturalism. 

1 . Canadian attitudes toward public policy. 
Canada has a relatively well- developed social 
safety net in that government funding supports 
extensive education, health, and social services. 
This has led to the expectation that services for 
children with learning disabilities should, for the 
most part, be provided by the public sector. Cana- 
dian attitudes toward education and service deliv- 
ery have affected the process of program develop- 
ment for these children, and constitute an impor- 
tant factor differentiating Canada from the U.S. 

2. Provincial jurisdiction. Jurisdiction of 
education in Canada is exclusively provincial 
(within a federal political system). Consequently, 
educational legislation and policy vary signifi- 
cantly from province to province. 

3. Official bilingualism. Canada has two 
official languages — English and French — and this 
has important implications for students with learn- 
ing disabilities who must either study or learn in 
a second language. 

4. Multiculturalism. In addition to its En- 
glish and French speaking populations, Canada 
has a large multicultural community. The domi- 
nant Canadian ethos holds that these communities 
should maintain the culture of their country of 
origin while simultaneously integrating into Ca- 
nadian society. This approach to multiculturalism 
implies an acceptance and fostering of diversity 



that contrasts with U.S. policy on multiculturalism, 
which rests more explicitly on the individual's 
right to equal protection under the law as guaran- 
teed by the U.S. Constitution. Diagnosing learn- 
ing disabilities and differentiating them from prob- 
lems stemming from the second language acqui- 
sition of English or French presents a major 
challenge. 

5. Native peoples. Children from Native 
Canadian communities have specific language, 
learning, and cultural needs; 1.3% of Canada’s 
children are from native groups. 

Policy Issues 

Canadian attitudes toward public policy. To 
characterize Canada’ s disposition toward the spe- 
cial education of children with learning disabili- 
ties, we rely here largely on our own observations. 
AsoneprominentCanadian sociologist has pointed 
out, there is a “paucity of information upon which 
to draw to construct a plausible Canadian charac- 
ter” (Porter, 1987, p. 90). It is our contention that 
Canadian attitudes have affected the making of 
policy and the mode of service delivery for chil- 
dren with learning disabilities. 

Canadians tend to value and trust publicly 
funded institutions. They do not express the same 
kind of ambivalence about public and private 
responsibility for children’s welfare that charac- 
terizes social attitudes in the U.S. (Jacobs & 
Davies, 1991). Canadians readily acknowledge 
the need for public provision of education, health, 
and social services, and they also prefer collabo- 
ration over litigation in the decision-making pro- 
cess. Lawsuits overeducational matters, although 
they do occur, are rare in Canada. Moreover, a 
philosophy upholding the importance of the whole 
child pervades the Canadian approach to service 
provision. 
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How have these attitudes affected the devel- 
opment of services for children with learning 
disabilities? First, most service provision takes 
place within public institutions — through the 
school system primarily, but also in hospital and 
university affiliated learning centers and children’ s 
mental health clinics. Although some children 
may experience waits for services as long as 6 
months, and more effective advocates get served 
more quickly, personal financial resources are 
seldom an issue. School boards and other agen- 
cies frequently make referrals to each other and 
collaborate on programs for individual children. 
Rarely, however, does one public institution (e.g., 
a school board) pay another (e.g., a hospital clinic) 
for services outside its jurisdiction. And distrust 
sometimes does arise between the private and 
public sectors. 

Concurring with the literary critic and author 
Margaret Atwood, Porter writes that “survival is 
a dominant theme in the country’s political life” 
and that our “concern for survival as a political 
entity places a premium on the practices of the 
past, [and] favors compromise and things as they 
are” (1987, p. 90). It is our view that Porter’s 
interpretation, while not inaccurate, can be ex- 
pressed more positively. Does a preference for 
compromise over confrontation, collaboration over 
litigation, really mean that “we Canadians are a 
conservative people” (1987, p. 90)? 

One specialist writes that “service provision 
for the learning disabled essentially has been a 
nonadversarial process which has facilitated ho- 
listic, collaborative interventions involving edu- 
cators, parents, peers, and most importantly, the 
learning disabled person.” (Kronick, 1987, p. 1). 
It is our view that this preference for compromise 
and collaboration has benefited children with 
learning disabilities. School special services staff 
have been able to devote their time and energy to 
more informal types of assessment with the goal 



of developing instructional strategies rather than 
formal assessments aimed more narrowly at docu- 
menting need. Committee meetings to decide on 
placement and programming are usually infor- 
mal and friendly. Even in provinces where actual 
legislation remains weak, services and programs 
have often been provided nonetheless, in the 
absence of mandatory policy. In addition, a pre- 
vailing holistic perspective has meant that at least 
some efforts have gone to developing social skills 
interventions, parent education, and counseling 
services. 

This nonadversarial, publicly funded, and 
collaborative provision of services must be un- 
derstood further in the context of the Canadian 
health care system, which funds clinics serving 
many children with learning disabilities. The 
health care system, unlike that of the U.S., is itself 
supported entirely by public funds and provides 
universal access. Public appreciation is wide- 
spread; litigation and malpractice suits are far 
less common, for instance, than in the U.S. For an 
excellent discussion of the Canadian health care 
system see “A search for solutions,” Consumer 
Reports, 1992. 

Provincial jurisdiction. Canada is comprised 
of ten provinces, the Northwest Territories, and 
the Yukon Territory. In 1988 the number of 
children in these provinces and territories ranged 
from 4,922 in the Yukon to 1 ,796,244 in Ontario. 
The four maritime provinces and the territories 
had a total child population of less than 500,000, 
with the Northwest Territories and Prince Ed- 
ward Island having approximately 13,000 and 
24,000 children respectively, and Newfoundland, 
Nova Scotia, and New Brunswick hovering around 

150.000 each. Two of the three prairie provinces 
(Manitoba and Saskatchewan) had just over 

200.000 children each. The child populations in 
Alberta, British Columbia, and Quebec were 
426,476, 514,464, and 943,652, respectively. 



Unlike the states of the U.S., the provinces of 
Canada have more autonomy from the federal 
government in the area of education. Each prov- 
ince has its own education act and policies pertain- 
ing to special education in general and learning 
disabilities in particular. All but three provinces — 
Prince Edward Island (the smallest province), 
Alberta, and British Columbia — have mandatory 
special education legislation (Poirier, Goguen, & 
Leslie, 1988). The legislation in these three prov- 
inces is permissive, that is, special education ser- 
vices may be provided by but not required of 
school boards. 

The specific provisions of mandatory special 
education also vary from province to province 
(Poirier, Goguen, & Leslie, 1988). While manda- 
tory legislation in the seven provinces that adopted 
it requires school boards to admit children with 
special needs, only in Manitoba and Quebec is this 
right to education universal, i.e., no child may be 
excluded from school. In the remaining five prov- 
inces the right to an education is quasi-universal in 
that some hard-to- serve children may be excluded. 
For example, in Ontario if a school board (the 
Canadian term roughly equivalent to a school 
district in the U.S.) determines that it does not have 
the facilities to provide for a specific child, the 
board may identify that child as “hard to serve.” 
According to the 1980 amendments to the Educa- 
tion Act in Ontario, the school board’s only obliga- 
tion is to ‘ assist the parent or guardian to locate a 
placement . . . suited to the needs of the pupil and 
reimburse the parent or guardian for any expenses 
incurred.” 

In only five provinces (Manitoba, New 
Brunswick, Ontario, Quebec, and Saskatchewan) 
are special education teachers required to have 
specialized certification and training. In only three 
provinces (Ontario, Quebec, and Saskatchewan) is 
there any provision that education must be appro- 
priate to the needs of the child. Even in these three, 



formal approval of individual educational plans 
may not be required. Only in Quebec and 
Saskatchewan must children be placed in the least 
restrictive environment. The mandatory legisla- 
tion in most provinces only covers school-aged 
children (ages 5 or 6, to 18 or 21); thus, in most 
provinces systematic early identification programs 
do not begin before school entry in kindergarten 
or grade 1. The exception is Quebec where 2,247 
exceptional preschoolers are being served by the 
school system. In most other provinces children 
may be referred by parents, physicians, or others 
to hospital or community and social services 
clinics for evaluation and preschool program- 
ming. Finally, in only four provinces (Nova Scotia, 
Quebec, Ontario, and Saskatchewan) does the 
legislation state that parents have a right to be 
involved in the decision-making process. 

Children with learning disabilities. Prov- 
inces differ in their provision of services for 
children with learning disabilities. In a survey 
conducted in 1988 by the Canadian Council for 
Exceptional Children (CEC), five of the ten prov- 
inces and both territories reported that they either 
did not categorize children at all or that children 
with mild disabilities received noncategorical ser- 
vices. It should be noted, however, that the five 
provinces that provide specific services for stu- 
dents with learning disabilities encompass 80% of 
the child population of Canada. One of the factors 
affecting the provision of services to children with 
learning disabilities is geography. Many of the 
provinces and territories providing only 
noncategorical services have small populations 
with a relatively high proportion of the population 
living in isolated rural communities (communi- 
ties separated by more than 100 kilometers). Some 
experts have suggested that the Financial and 
social costs of transporting children with learning 
disabilities to communities large enough to pro- 
ide specialized services may outweigh the ben- 
efits (Bachor & Crealock, 1986). 
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The proportions of children formally identi- 
fied as having learning disabilities varied widely, 
according to the CEC survey, in the five prov- 
inces that provide categorical service, with Que- 
bec identifying 10.2% of its children. Nova Scotia 
7%, Ontario 3.1%, Saskatchewan 1 .7%, and Brit- 
ish Columbia 1.3%. Identification rates within 
provinces may also show considerable variation. 
For example, a survey of the mental health needs 
of children and youth with learning disabilities in 
Metropolitan Toronto (Canada’ s largest city with 
a population of approximately 3 million people), 
identified 4.2% of school children as having 
learning disabilities, but the rates of the seven 
separate school boards ranged from 0.9% to 27% 
(Cummings, Hebb-Grier, Brazil, & Vallance, 
1990). The definitions of learning disabilities are 
quite varied and not governed by formulae as are 
some in the U.S. 

Services for children with learning disabili- 
ties also vary widely within the same province or 
adjacent school boards, but this is not mainly 
related to funding. Unlike school districts in the 
U.S., Canadian school boards do not receive 
provincial or federal funds based on the number 
of children identified with learning disabilities 
(or other handicaps). School board funds come 
from a combination of local property taxes and 
direct provincial funding, with provinces having 
formulae that adjust the level of provincial contri- 
bution based on the property tax base of the local 
community. There is no direct federal funding of 
education. Thus, school boards within provinces 
are relatively homogeneous in the amount of 
funds available to them. School boards in com- 
munities with high levels of poverty and large 
numbers of children with special needs may still 
find it difficult, however, to provide adequate 
services with the funds available. 

The Constitution Act of 1982, which en- 
acted the Canadian Charter of Rights and Free- 



doms, has had important implications for the 
rights of children with learning disabilities and 
other exceptional children. Although this federal 
law does not apply directly to education, the 
provisions of the Charter override virtually all 
provincial legislation. The equality rights provi- 
sion of the Charter states: 

“Every individual is equal before and under 
the law and has the right to the equal protection 
and equal benefit of the law without discrimina- 
tion and, in particular, without discrimination 
based on . . . mental or physical disability” (Cana- 
dian Charter, 1982). 

Of the four basic rights accorded every 
individual, “the right to equal benefit of the law” 
(i.e., a distribution of resources commen- 
surate with need) is especially important 
(Henteleff,1990). The law suggests that even in 
the absence of mandatory special education leg- 
islation (as in Prince Edward Island and British 
Columbia) or where legislation has not specified 
appropriate or required programs, the Charter 
may provide this right. Recent cases brought 
before the Supreme Court, for example, have 
stipulated that parents have a right to a hearing of 
a case for special education for their child, even 
if provincial education legislation does not in- 
clude this provision. Since no Charter case in- 
volving children with learning disabilities has 
yet reached the Supreme Court, there is consid- 
erable debate about how the Charter will be 
interpreted (Poirier, Goguen, & Leslie, 1988; 
Henteleff, 1990). 

Bilingualism and Multiculturalism 

Official bilingualism. English and French 
are the two official languages of Canada, and the 
federal government and the province of New 
Brunswick are officially bilingual. In Quebec, 
the official language is French and the majority 
of the population speaks French; in the rest of 
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Canada, the official language is English and the 
majority of the population speaks English. In 
most of Canada, alternative services in their na- 
tive language are provided to minority 
Francophones (French-speaking people) or 
Anglophones (English-speaking people) where 
sufficient demand exists. Further, at some point 
in theireducation (the timing and amount varying 
within each province), children receive instruc- 
tion in the second official language; approxi- 
mately 50% of Anglophone children outside 
Quebec are receiving French language instruc- 
tion at any given point in time (Statistics Canada, 
1990), and all Quebec Francophone children re- 
ceive English instruction from grade 4 through 
high school. In 1988-89, 1.9 million students 
were enrolled in second-language French courses. 
Also, approximately 5% of Anglophone children 
(228,000 children in 1988-89) are enrolled, at 
their parents’ request, in bilingual or French 
immersion programs wherein at least half the 
school day (typically all but one hour of the 
school day) is spent studying in French. 

What implications does official bilingual- 
ism have for children with learning disabilities? 
Lambert (1975) has used the terms “additive” and 
“subtractive” to describe the consequences of 
bilingualism. Positive or negative consequences 
are seen to be related to the context in which 
children are learning two languages, not to bilin- 
gualism per se. In some contexts, the learning of 
a second language can have either a positive 
impact or no impact on the development and 
maintenance of the child’s mother tongue, and no 
negative consequences on cognitive or affective 
development. Lambert has called this additive 
bilingualism. In other contexts, the learning of a 
second language can have a negative impact on 
first language acquisition and possibly other nega- 
tive consequences on cognitive and affective 
development. This is subtractive bilingualism. 



Assessing the additive or subtractive effects of 
bilingualism on children with learning disabili- 
ties may be particularly problematic. 

In this analysis we examine three different 
contexts of language acquisition. The first in- 
volves the provision of second language instruc- 
tion as a subject within the larger curriculum. 
Typica'ly this “core” French or English program 
relies on decontextualized and academic meth- 
ods of instruction whereby children study the 
language itself but do not use the language to 
study other subjects. Typically, the second lan- 
guage is not in common use in either the home or 
community. Children with learning disabilities 
may find core French or English instruction prob- 
lematic because learning a second language in 
this way may be especially difficult for them 
(Bruck, 1982; Wiss, 1989). While core French or 
English instruction may not be subtractive for 
children with learning disabilities, in the sense 
that it detracts from the acquisition of their first 
language, it is possible that frequent failure expe- 
riences with the new language may have negative 
affective consequences; empirical evidence is 
lacking on this point, however. Such consequences 
(to the extent that they exist) may follow, how- 
ever, from how the second language is taught. 
Second language instruction typically requires 
that students memorize verb forms, noun gen- 
ders, and spelling, rather than that they converse 
and learn practical vocabulary and phrases. 

A second context for language acquisition is 
that in which minority Anglophone or 
Francophone children receive instruction in their 
first language where this is not the dominant 
language of the province. This is the typical 
context for Anglophones in Quebec and 
Francophones outside Quebec. In some situa- 
tions, such as that of Francophones in New 
Brunswick and Anglophones in Quebec, their 
first language is usually the dominant language of 
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their family and local community and, in the case 
of Quebec Anglophones, the dominant language 
of North American culture and the media. These 
communities attempt to provide special education 
resources in the first language for children with 
learning disabilities. 

Identifying children with learning disabili- 
ties within this context of bilingualism can prove 
difficult, however. Standardized tests normed for 
Quebec Francophones may not be appropriate for 
minority Francophones outside Quebec. Further- 
more, even though legislation in provinces such 
as Ontario mandate it, recruiting qualified teach- 
ers to provide special education services in French 
to minority Francophone children with learning 
disabilities can be problematic. And even when 
teachers are themselves bilingual and trained in 
special education, they lack the methods with 
proven effectiveness to help children in these 
situations. 

Francophones outside Quebec and New 
Brunswick usually represent such small minori- 
ties in their communities that they have to know 
English for daily living. In some cases French 
may cease to be the dominant language of the 
home, and a combination of English and French 
may become the child’s first language. In some 
places where their numbers warrant it, such chil- 
dren may receive French education intended to 
maintain their language and culture. But accord- 
ing to Carey, “they are bilingual with incomplete 
learning of either language, and this provides for 
increased inconsistencies in sound to syllable 
mapping” (1987, p. 106). Similar to children with 
reading disabilities, they are slower in word nam- 
ing and phonetic decoding, and have longer laten- 
cies for abstract as opposed to concrete words. 
Their educational attainment is lower than 
Anglophones. In short, the experience for many of 
these Francophone children is subtractive bilin- 
gualism (Carey, 1987). 



A third context for the development of bilin- 
gualism includes French immersion and bilin- 
gual programs for Anglophone children. In French 
immersion programs, Anglophone children re- 
ceive almost all of their education in French with 
only approximately one hour per school day 
devoted to English language arts. In bilingual 
programs, instruction is in English for half the 
day, in French the other half. Anglophone chil- 
dren typically enter immersion with well-devel- 
oped skills in their first language but little expo- 
sure to French other than the short clips they see 
on Canadian Sesame Street (Carey, 1987). While 
French immersion programs may begin at any 
grade level, they begin most often in kindergarten 
or grade 1. Considerable evidence, including 
some from longitudinal studies, suggests that 
French immersion promotes additive bilingual- 
ism for the majority of Anglophone students 
(Carey, 1987; Lambert & Tucker, 1972). The 
extent to which this holds for children with learn- 
ing disabilities has been the subject of consider- 
able debate (Wiss, 1989). Some investigators 
assert that when children with learning disabili- 
ties are experiencing difficulty in both their French 
and English programs, it may be beneficial to 
maintain them in French immersion if they can 
obtain appropriate remedial assistance within the 
immersion program (Bruck, 1978; 1982; 
Cummins, 1979). Others suggest that a subgroup 
of children with learning disabilities who experi- 
ence difficulty in early French immersion only, 
not in English, may benefit from being exempted 
from the immersion program (Trites, 1976, 1981). 

A number of practical questions have arisen 
from the enrollment of children with learning 
disabilities in early French immersion programs. 
First, of course, is that at the time they enroll they 
have not yet received systematic reading instruc- 
tion, and their learning disabilities are yet to be 
detected. Both researchers and practitioners ques- 
tion whether a reliable method can be devised to 
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identify children with problems prior to school 
entry, before they encounter early immersion 
(Trites, 1986; Wiss, 1987). 

How to diagnose learning disabilities once 
children have entered French immersion consti- 
tutes a second set of questions. Should they be 
assessed in English, in French, or in both? If 
achievement tests based on norms of children 
who are native speakers of English or French are 
deemed inappropriate — and the consensus is that 
they are (Carey, 1987; Wiss, 1987) — what tests 
or assessment methods should be used? If achieve- 
ment is low in either or both languages, does this 
mean the child is necessarily learning disabled, or 
might he or she be simply responding poorly to 
immersion programming? At this point there are 
no answers to these questions. Children who 
experience difficulties are usually withdrawn from 
the French immersion program. 

A third set of questions asks what the most 
appropriate placement and program should be 
once children with learning disabilities begin 
their schooling in French immersion programs. 
While children with learning disabilities will 
continue to experience learning difficulties when 
switched to an English program, are the problems 
as great as they would have been had they re- 
mained in French immersion (Brack, 1978, 1982; 
Cummins, 1979)? Do different subgroups re- 
spond better to remaining in French immersion, 
or switching to English programming? Should 
special education be provided in French, English, 
or both? If special education in French is unavail- 
able to children with learning disabilities in French 
immersion programs, as is frequently the case, 
should they remain in French immersion? 



Official bilingualism has been a major po- 
litical and social concern in Canada since confed- 
eration in 1867. This issue takes on special sig- 
nificance for children with learning disabilities, 
because the requirement that they learn a second 
language or learn in a second language may lead 
io additional frustration and failure. Whereas 
experiences with bilingual schooling may be typi- 
cally additive for children without learning dis- 
abilities, the same experiences may subtract from 
the first language functioning of children with 
learning disabilities. These possible negative cog- 
nitive and affective consequences remain a sub- 
ject of debate. Educational systems tend to dis- 
courage research on these issues, sometimes ap- 
pearing reluctant to allow investigators permis- 
sion to address these questions. Consequently, 
research that would contribute to this debate has 
been limited. 

Multiculturalism. In addition to 
Anglophones and Francophones, Canada has a 
substantial population whose first language is 
neither English nor French. 1 The first language in 
this case is referred to as a “heritage language,” 
and in Quebec the speakers are referred to as 
Allophones. Allophones in Canada (including 
native peoples), in 1986 (the last census), num- 
bered 2,860,565, or 1 1 .3% of the total population. 
Considering the recent wave of immigration, 
these numbers have likely increased since then. 
Allophone children are concentrated in the three 
largest metropolitan centers in Canada — Toronto, 
Montreal, and Vancouver — with more than 50% 
of children in Toronto and Vancouver coming 
from homes where neither English nor French is 
the native language. Except for a fairly large 



'The proportion of non-native English speakers in Canada is much larger than in the U.S. In the U.S. 7.5% of residents 
over age 5 are native speakers of Spanish, and an additional 6.5% are native speakers of a language other than Spanish or 
English (Barringer, 1993). In Canada, in contrast, 40% of the population are native speakers of French, with an addi- 
tional 1 1.3% native speakers of a language other than French or English (Statistics Canada, 1990). 
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Chinese community, most immigrants to Canada 
prior to 1970 came from Europe. More recent 
waves of immigrants and refugees have come 
from countries in the Caribbean and Latin 
America, from Africa, the middle East, the Indian 
subcontinent, and in particular, from Somalia, 
Vietnam, Sri Lanka, and Hong Kong. 

Allophone children may receive instruction 
during the regular school program in their heri- 
tage language. Typically, heritage language pro- 
grams involve 3 to 7 hours per week and take 
place in the school if there are sufficient numbers 
of children (usually at least 25) who speak that 
language. There are currently 82 heritage lan- 
guages taught in the Toronto school system. In 
addition to programs in the schools, the govern- 
ment and the language communities themselves 
fund programs that provide instruction in the 
language and culture after school and/or on Sat- 
urdays. 

Educational problems related to 
multiculturalism resemble to a certain extent those 
of official bilingualism, that is, how to facilitate 
additive bilingualism or trilingualism. There is 
universal agreement that Allophone children 
should learn at least one of the dominant lan- 
guages (i.e., English or French). Educators also 
generally agree that strong functioning in the first 
language predicts good acquisition of a second 
and third language (Cummins, 1987). Where 
specialists do not agree is over the question of 
whether Allophone children who are weak in 
their first language should receive help to 
strengthen that language prior to or simultaneous 
with being taught a second or third language. 

Allophone children may be weak in their 
first language for a variety of reasons (Cummins, 
1984). Refugees may be victims of trauma and 
haphazard or nonexistent schooling. Others may 
have parents who neglect the child’s first lan- 
guage because they believe that even though their 



own second language skills are limited they are 
helping their children by speaking to them in 
English or French. As a result the children receive 
poor language modeling in both languages. Fi- 
nally, some Allophone children do indeed have 
learning disabilities. 

Canadian researchers and educators face a 
challenging problem in diagnosing learning dis- 
abilities among Allophone children. Although 
most children readily acquire the basic interper- 
sonal communication skills in a second language, 
linguistic proficiency in academic content can 
take five years or more (Carey, 1987; Cummins, 
1984). Consequently it is often difficult to dis- 
criminate between those Allophone children with 
learning disabilities who may require special 
education and those children who simply need 
more instruction in English or French as a second 
language. One can conceptualize learning dis- 
abilities as a continuum where children at one end 
are clearly not learning disabled and those at the 
other end would have learning difficulties no 
matter what the educational environment. It has 
been suggested that along this continuum lies a 
group of children who are vulnerable, but who, 
under optimal conditions, would not develop 
learning disabilities (Adelman, 1989). Some 
Allophone students who are being asked to learn 
in a new language within a foreign culture may fit 
into this category. 

Some researchers and educators have criti- 
cized Canadian school boards for excessive iden- 
tification of Allophone children as learning dis- 
abled (Cummins, 1987). Consequently, progres- 
sive school boards currently use dynamic assess- 
ment techniques that involve teaching the child in 
the course of the assessment as a way of learning 
more about the child's skills, and they tend to be 
cautious about prematurely identifying Allophone 
children as learning disabled. This approach has 
the disadvantage that some Allophone children 



ERIC 




9 



with learning disabilities who would benefit from 
special education services will not be identified. 
Such errors may be inevitable, however, until 
more reliable assessment methods are developed. 
Currently, investigators at McGill University in 
Montreal and the Ontario Institute for Studies in 
Education in Toronto are attempting to develop 
such tools. 

A number of studies have found that partici- 
pation in heritage language programs has no det- 
rimental effect on second language acquisition, 
whether English or French, and that children in 
these programs also achieve reasonable profi- 
ciency in the heritage language (Cummins et al., 
1984; da Fontoura & Siegel, in press; Swain, 
Lapkin, Rowen, & Hart, 1990; Siegel & So, 
1993). Most of the heritage language groups in 
these studies performed as well in tests of lan- 
guage and reading proficiency as their monolin- 
gual, English-speaking peers. In addition, bilin- 
gual children who demonstrate reading problems 
in English have been shown to have similar prob- 
lems in their heritage language, whether it is 
Portuguese (da Fontoura & Siegel, in press), J apa- 
nese (Saito & Siegel, 1993), Arabic (Abu Rabia & 
Siegel, 1993), or Cantonese (Siegel & So, 1993). 

Native peoples. As already indicated, 
Canada’s aboriginal or native peoples comprise 
approximately 1.3% of the population. Of this 
segment about 65% live in remote or rural com- 
munities, including the Inuit peoples, many of 
whom live in isolated regions in the North (Csapo, 
1989). Although language plays a role in any 
discussion of learning disabilities within native 
groups (Native Canadians represent 10 language 
groups and 58 dialects), poverty and cultural 
differences present more pressing issues. 

Native Canadians have suffered enormous 
exploitation, first by the European settlers, and 
subsequently by the dominant, majority govern- 
ments; and individual groups have suffered as- 



saults on their livelihood and culture. Poverty 
rates are high among native groups. Neonatal 
mortality is approximately 60% higher than the 
national rate, with the average death rate for 
native children three times that of the national 
rate (Csapo, 1989). The major causes of death 
among Native Indians are accidents, violence, 
and poisonings, with suicides being three times 
the national rate. It is estimated that 50% to 60% 
of Indian illnesses are alcohol-related, with fetal 
alcohol syndrome a relatively common problem 
among children. Due to the prevalence of persis- 
tent upper respiratory infections, children also 
frequently suffer hearing loss. 

The education of Native Indians is the re- 
sponsibility of the federal Department of Indian 
and Northern Affairs. Until the late 1960s many 
Natives received their education in residential 
schools whose aim was to eliminate their lan- 
guage and culture. Since 1970, however, the 
Department has run some schools on reserves and 
provides, in other cases, financing for schools to 
be run by the native bands themselves. Support is 
also provided for Natives to attend public schools 
under provincial jurisdiction. Nevertheless, re- 
searchers and advocates often express concerns 
about discrimination and the tracking of Natives 
into low academic streams. 

The average educational attainment of na- 
tive peoples is less than that of the non-native 
majority, with only 60% between the ages of 14 
and 18 attending school, compared to a national 
average of 75%. University attendance is less 
than half the national average of 25%. In the 
larger context of physical health, mental health, 
and learning problems, the task of differentiating 
between low academic achievement that is due to 
poverty and cultural factors and that due to learn- 
ing disabilities has not been a priority. It is clear, 
however, that native children are at risk, and that 
prevention programs are much needed. 
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Conclusion 

Our discussion paints overall a promising 
picture of learning disabilities practice and re- 
search in Canada. But the analysis also reveals 
some problems. First, of concern is the fact that 
three provinces — British Columbia, Alberta, and 
Prince Edward Island — have not yet passed spe- 
cial education legislation mandating services. 
Although precedents under the federal Charter of 
Human Rights and Freedoms (Canadian Charter, 
1982) suggest that exceptional children in these 
provinces do have a right to an education and 
parents to due process, the Charter does not set, 
and the courts have not required, specific stan- 
dards for that education. Even the legislation in 
most provinces where quasi-universal or univer- 
sal access has been established is somewhat defi- 
cient in the setting of such standards. 

The lack of special education services for 
native children with learning disabilities consti- 
tutes a second problem. This lack of services, it 
must be noted, is not confined just to learning 
disabilities; native children also have serious 
economic, physical, and mental health needs re- 
quiring attention. 

Although school-aged children with learn- 
ing disabilities are relatively well provided for in 
Canada, serious gaps in services for preschool 
children persist. While there are many excellent 
diagnostic and treatment centers serving pre- 
school children with learning and other develop- 
mental disabilities, few provinces have system- 
atic screening programs for identifying them prior 
to school entry. Much more research needs to be 
conducted on appropriate assessments and 
remediation strategies for children with possible 
learning disabilities in a bilingual and multicultural 
setting. We do not yet know how best to identify 
these children or exactly how to treat their prob- 
lems once identified. 



Finally, Canada can nevertheless serve in 
some aspects as a model. As traditions have it, 
services for children with learning disabilities are 
relatively unfragmented and tend to serve the 
whole child. Teaching and teachers tend to re- 
ceive respect Collaborative consultation and col- 
laborative advocacy are the rule, not the excep- 
tion. Unlike in many states in the U.S., there is not 
usually the insistence on a strict definition of a 
learning disability in order for a child to be served. 
Consequently, in spite of legislative inadequacies 
and problems related to identification, most chil- 
dren with learning disabilities are able to receive 
services. 
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Head Start is America’s favorite social pro- 
gram — oris it? President Clinton appeared to echo 
public sentiment in his first congressional address 
when he described the program as a sound social 
investment deserving of additional support. But 
what seemed like a unanimous call to fully fund 
Head Start was soon punctuated by voices of 
dissent. The attack should have been expected — 
the inevitable backlash that occurs whenever a 
favored position becomes so politically correct as 
to invite scrutiny. Also, conservatives were newly 
freed from their duty to side with the former 
Republican president, who vocally supported Head 
Start, and found the changing of the political guard 
an opportune time to counter whatever the Demo- 
cratic victors held dear. They found ammunition 
in the constructive criticisms being offered by 
Head Start's friends, many of whom had silenced 
their concerns during prior administrations intent 
on reducing social welfare expenditures. Soon the 
print media began to fill with articles portraying 
Head Start as a national boondoggle, a “scam” that 
is a waste of precious tax dollars (Hood, 1992, 



1 993). The program's advocates felt their sense of 
victory turn into a defensive stance. 

They have assumed this posture before. 
Since its inception, opinions of Head Start have 
bounced from positive to negative and back again. 
The project has been favored as a quick means of 
boosting poor children's IQ scores and criticized 
when these increases did not appear to be perma- 
nent. It has been praised as a cure for welfare 
dependency, teenage pregnancy, and criminality 
and condemned because no one ever collected 
data to substantiate such claims. Today Head Start 
is in the middle of an ideological debate over the 
proper role of government in solving social prob- 
lems. Can the program help the nation attain its 
goal of having all children ready to leam when 
they begin school, or is it just another example of 
"Big Brother" intruding in the lives of private 
citizens? 

Head Start has clearly been subjected to 
unrealistic expectations and entangled in the web 
of politics. The time is long overdue to look at this 
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program objectively, sitting fact from opinion and 
separating the emotions from the evidence that 
should be the primary basis for determining its 
future course. Head Start has been in operation for 
nearly 30 years and has been evaluated more 
extensively than perhaps any other social pro- 
gram. During this time the disciplines devoted to 
early childhood have blossomed and generated 
sophisticated theories and practical applications. 
The purpose of this paper is to use this accumu- 
lated wisdom to make a realistic assessment of 
Head Start. Such an analysis is vital not only to 
determining whether the program's record justi- 
fies its expansion. The knowledge derived from 
experience with Head Start can guide the program' s 
evolution and inform the search foreffective strat- 
egies of early intervention and education for chil- 
dren in poverty. 

Head Start’s Program and Goals 

The current controversy over the value of 
Head Start is riddled with misconceptions about 
the program's structure and mission. Head Start is 
referred to either as child care, preschool educa- 
tion. health and social services, or a training and 
job program for poor adults. The project, in fact, 
serves all of these functions. Many observers also 
have confused Head Start with the Perry Pre- 
school. an educational intervention project con- 
ducted in the 1960s and still being evaluated for 
lifetime effects. In reality. Head Start is a very 
different type of program than the Perry Project 
was, so the two should not so casually be com- 
pared. To enable constructive debate and effective 
social policy, interested parties must have an accu- 
rate view of Head Start's nature. This section 
describes Head Start's model, goals, and imple- 
mentation to help clarify what this frequently 
misunderstood program actually encompasses. 



A Brief His tory 

Project Head Start originated during an 
optimistic period of American history, a time 
when there were both the desire and financial 
means to attempt to eliminate poverty and pro- 
mote social equality. The federal government took 
a proactive role in this effort, launching the War on 
Poverty to provide education and self-help oppor- 
tunities to enable socioeconomically disadvan- 
taged citizens to improve their lives. The 1964 
Economic Opportunity Act (EOA) opened the 
War by creating several projects for poor adults, 
including the Community Action Programs. The 
CAP were designed to assist local communities in 
establishing and administering their own antipov- 
erty programs. However, some local governments 
opposed the CAP's proposed placement of admin- 
istrative control and resources in the hands of 
impoverished citizens and therefore refused to 
apply for program grants. In an effort to make the 
CAP more palatable to local officials, while using 
what would have been an embarrassing budget 
surplus, the Head Start project was born. Sargent 
Shriver. head of the Office of Economic Oppor- 
tunity and chief strategist of the Johnson Admin- 
istration's War on Poverty, was struck by the 
realization that children comprised half of the 
population living below poverty. He envisioned a 
preschool intervention program as a way to “over- 
come a lot of hostility in our society against the 
poor,” because no one could blame children for 
their financial predicament or fear giving them too 
much power (quoted in Zigler & Anderson, 1 979, 
p. 12). 

With the exception of a few compensatory 
education projects, there was very little experi- 
ence at the time to suggest how to meet the needs 
of economically disadvantaged preschoolers. Is- 
sues of program content, size, and duration were 
therefore open questions for Shriver as he began 
piecing together a national intervention project. 
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His first step was to appoint a planning committee 
of 14 experts from the fields of preschool educa- 
tion, health, child development, and mental health. 
This diversity ensured that Head Start would be- 
come far more than an educational program. (For 
the history of the development of Head Start, see 
Zigler & Nluenchow, 1992, and Zigler & Valen- 
tine, 1979.) 

Dr. Robert Cooke, a noted pediatrician, 
chaired the group, which had only a few months to 
develop their plans. The committee’s recommen- 
dations, presented to Shriver in February 1965, 
were based on a “whole child” philosophy that 
called for comprehensive programming: Head 
Start’s goals were to improve physical health, 
enhance mental processes (particularly concep- 
tual and verbal skills), and foster social and emo- 
tional development, self-confidence, relationships 
with family and others, social responsibility, and 
a sense of dignity and self-worth for both the 
child and family (“Recommendations for a Head 
Start Program.” 1965). Note that only one of these 
seven goals specifically related to intellectual 
performance. 

The committee’s recommendations were 
unique not only because they outlined a multifac- 
eted intervention but because they included the 
family as well as the child. The planning docu- 
ment accorded parents a central role that was 
virtually unprecedented in the design of preschool 
or compensatory education programs. Prior to 
Head Start, economically disadvantaged families 
had been treated as passive recipients of services 
dispensed by professionals. But Head Start par- 
ents were to be involved in the planning, adminis- 
tration. and daily activities of their local centers. 
One reason for this decision derived from Head 
Start's origins in the CAP, which were mandated 
to allow poor citizens “maximum feasible partici- 
pation” in running antipoverty efforts. The EOA 
contained no clear definition of this slogan, and 



the Head Start planning committee interpreted it 
more literally than many local officials might have 
liked (see Valentine & Stark, 1979). The planners 
were also influenced by the ideas of one member, 
Urie Bronfenbrenner, who was just beginning to 
develop his ecological approach to human devel- 
opment. Bronfenbrenner (1974, 1979) argued that 
there is a complex interrelationship among chil- 
dren. their families, and communities, so interven- 
tion must touch all of these areas to be effective. 
This insight was an astute one: today, parent 
participation is recognized as crucial to the suc- 
cess of early intervention as well as later education 
(e.g.. Comer, 1980; Powell, 1982; Seitz, 1990). 

Program Implementation 

Expert advisors and planning committee 
members counseled Shriver that a small, pilot 
program should be run and evaluated prior to 
mounting a large-scale effort. Yet the Johnson 
Administration demanded that Shriver fire a ma- 
jor volley in the War on Poverty by beginning 
on a large scale, a plan made possible by the 
economic prosperity of the times. When Head 
Start opened in 1965, over 500,000 children were 
served in a summer program lasting 6 or 8 weeks. 
Unfortunately, there was not enough time to screen 
all the grant applications carefully, so variation in 
quality characterized Head Start from the very 
beginning. 

Today the basic Head Start program is a 
center-based preschool serving primarily poor 
children aged 3 to 5. with the majority (63%) 
being 4 years old (Administration on Children, 
Youth and Families, 1993). Most children attend 
a half-day session for a school year, although 
some participate for 2 years. Full-day programs 
and home-based services are delivered in some 
locations. Federal guidelines require that at least 
90% of the children enrolled be from families 
whose income falls below the poverty line; at least 
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10% of enrollment must consist of children with 
disabilities. Head Start programs by law receive 
80% of their funding from the federal government 
and the rest from other, usually local, sources, 
which may be in the form of in-kind services. 

In fiscal year 1 992, there were 1 ,370 Head 
Start grantees serving over 62 1 ,000 children and 
their families (Administration on Children, Youth 
and Families, 1993). (This is less than the yearly 
enrollment in the late 1960s but represents a 35% 
increase since 1989.) Each program is required to 
include six components: early childhood educa- 
tion, health screening and referral, mental health 
services, nutrition education and hot meals, social 
services for the child and family, and parent in- 
vc lvement. Local programs are overseen by Policy 
Advisory Councils, composed of parents, staff, 
and community representatives who are respon- 
sible for operating and staffing decisions. Although 
all Head Start programs must adhere to a body of 
national regulatory standards, each is encouraged 
to adapt the program to local needs and resources. 
Thus it is somewhat misleading to speak of Head 
Start as a singular intervention because of the varia- 
tion in localized programming. 

Evaluating Head Start 

Head Start was designed as a comprehen- 
sive, two-generation, community-based program. 
Its complexity and uniqueness made evaluation 
difficult from the start. Federal officials under- 
standably wanted a national impact study to ascer- 
tain the effects of this costly new experiment. Yet 
the diversity of Head Start programs around the 
country would confound comparisons of preschool 
and control groups, because the preschool samples 
would not all have had the same Head Start expe- 
rience. The participants themselves were a hetero- 
geneous population differing in terms of racial, 
ethnic, linguistic, geographic, and other charac- 



teristics. Further, random assignment to pre- 
school and comparison groups would be 
problematic because eligibility for Head Start 
was defined by law. This left evaluation subject to 
less rigorous quasi-experimental designs (Cook & 
Campbell, 1979). Another problem was that 
there were no generally accepted definitions or 
measures of some concepts like social compe- 
tence and parental empowerment that were part 
of Head Start's agenda. 

The burst of research in the program's first 
decade illuminated these and other problems in 
evaluating such a complex intervention. Although 
this paved the way for more informed investiga- 
tions, few were subsequently conducted. Support 
for Head Start research was greatly reduced dur- 
ing the 1980s, falling from 2.5% of the project's 
budget in 1974 to. 1 l%in 1989, including demon- 
stration efforts (National Head Start Association, 

1 990). Much of what is known about the effects of 
Head Start is therefore based on old, sometimes 
problematic studies, and, as we will discuss later 
in this section, there is much that is still not known. 

Focus on Intelligence 

Despite the many goals of Head Start, 
initial research focused almost exclusively on 
how much the program could raise children's 
intelligence test scores. This emphasis sprang 
from a popular view within developmental psy- 
chology that even limited environmental interven- 
tions could stimulate cognitive growth (see Schorr, 
1988; Spitz, 1986; Zigler, 1970). Supporters of 
this position held that if the “right” experiences 
were provided at the “right” time (commonly 
assumed to be before age 5), IQ scores could be 
raised dramatically. Another reason that Head 
Start came to be judged on the basis of IQ changes 
could be that psychologists conducted most of the 
evaluations, and standard IQ tests are tools of their 
trade. If physicians had designed the first studies. 
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perhaps Head Start's health benefits would have 
been evaluated instead. The positive impact of the 
health component (see below ) might have become 
better known as a result. 

Also contributing to the fixation on intel- 
lectual effects is the fact that the results were 
striking. Early studies showed that children's IQ 
scores leapt by at least 1 0 points and their achieve- 
ment levels rose after just a few weeks of Head 
Start (see Datta. 1979). Then researchers at Ohio 
University and the Westinghouse Learning Cor- 
poration delivered the unwelcome news that the 
achievement gains faded away shortly after chil- 
dren entered school (Cicirelli, 1969). Experts in 
statistics and evaluation conducted in-depth re- 
views of the Westinghouse report and criticized it 
for problems with sampling procedures, data 
analysis, and appropriateness of the outcome 
measures (e.g., Campbell & Erlebacher. 1970; 
Datta, 1976; Lazar, 1981; Smith & Bissell, 1970). 
Nonetheless, subsequent studies of Head Start and 
almost every other early intervention program 
reached the same conclusion: preschool graduates 
generally do not continue to do better on cognitive 
tests or school quizzes. These reports dashed hopes 
that brief preschool experiences could guarantee 
academic success for poor children. Disappointed 
federal officials began to entertain plans to phase 
out Project Head Start. 

Parents — empowered by their involvement 
in Head Start — came out in force and ultimately 
saved the program. Their efforts were soon bol- 
stered by the release of findings from the Consor- 
tium for Longitudinal Studies (1978, 1983), a 
group of investigators who had evaluated 1 1 early 
intervention programs (2 of them Head Start) 
during the 1960s and early 1970s. The researchers 
located original program participants and col- 
lected a uniform set of information about them. 
Data from the programs were combined and ana- 
lyzed using sophisticated techniques which avoided 



the problems that had weakened the validity of the 
Westinghouse findings. All significant results 
were reanalyzed by dropping the program with 
the strongest effect to determine whether it still 
held among the others (in effect, guaranteeing 
conservative conclusions). 

The Consortium results confirmed that 
children who attend quality preschool programs 
evidence an initial boost in IQ and achievement 
scores that lasts for some years but eventually 
fades. However, lasting effects were found in 
other areas: participants were less likely to be 
assigned to special education classes, and were 
somewhat less likely to be held back a grade in 
school . The rigor of the Consortium methodology, 
and the findings of benefits that persisted until 
many children had reached 1 2 or more years of 
age, helped to restore faith in the value of early 
intervention. 

Other longitudinal studies and reviews of 
research specific to Head Start programs have 
yielded similarly encouraging results. The Head 
Start Synthesis Project, a meta-analysis of some of 
over 200 studies reviewed, also found the loss of 
initial c itive gains but reported that Head Start 
children had better health, immunization rates, 
and nutrition, as well as enhanced socioemotional 
characteristics (McKey et al., 1985). Family life 
was also found to be strengthened through the 
in *olvement of parents in the program. A study of 
thousands of children who had attended Head 
Start in 33 programs throughout Philadelphia 
showed that they had better school adjustment 
than peers who had no preschool experience 
(Copple, Cline, & Smith, 1987). After at least 
sixth grade, the preschool graduates had fewer 
school absences, did not miss as many standard- 
ized tests, and seemed less likely to be retained in 
grade. Another study of over 1 ,900 children who 
had attended Head Start at three different times 
revealed that by the end of high school, the oldest 
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group performed better academically than a con- 
trol sample (Hebbeler. 1985). 

Although much of this research has tech- 
nical problems that make the findings tentative, 
the results are all in the same direction: Head Start 
does not permanently raise IQ scores but may have 
enduring effects on school adjustment and other 
aspects of social competence (McCall. 1993). 
Findings concerning immediate program effects 
are much more definite: when children leave Head 
Start they have better IQ test scores and school 
readiness skills. In other words. Head Start does 
help prepare children for school. In lamenting the 
loss of initial cognitive benefits, many critics have 
overlooked Head Start's success in this regard. 

There are several reasons why children 
leave the program testing well and then may lose 
this advantage. One is that performance on an IQ 
test reflects not only formal cognitive processes 
but achievement and personality variables as well. 
The IQ gains apparent after Head Start are not 
necessarily due to expanded intellectual capacity 
hut may instead be explained by improved moti- 
vation. familiarity with test content, and comfort 
in the testing situation (Seitz. Abelson, Levine. & 
Zigler. 1975; Zigler. Abelson. Trickett, & Seitz. 
1982; Zigler & Butterfield. 1968). Thus, partici- 
pation in Head Start can enable children to de- 
velop the skills and attitudes needed to apply their 
abilities more fully. When they enter school, how- 
ever. the environment may not continue to encour- 
age full use of their potential; for example, there 
may be a poor curriculum or teaching practices 
that lower self-confidence and ignore individual 
learning styles. Another explanation for fade-out. 
advanced by early childhood educators, is that the 
paper-and-pencil. standardized group achievement 
tests used in many studies are inappropriate for 
children in the primary grades (e.g.. Meisels. 1992). 
This argument supports Barnett's ( 1992) position 
that the achievement gains made in preschool 



probably do not fade out at all. He suggests that 
their apparent loss is an artifact of measurement, 
statistical analysis, and sampling procedures. For 
example, poor achievers in both preschool gradu- 
ate and comparison groups are likely to be retained 
in grade and thus dropped from the evaluation, 
thereby minimizing the differences among the 
remaining students. 

Confusion with Other Interventions 

Many of the current criticisms of Head 
Start center on the argument that the project's 
results do not match those of more expensive 
preschool programs. This claim would not have 
assumed much importance if the findings from 
one such program — the Perry Preschool — did not 
become widely attributed to Head Start. Yet the 
fact that they did encourages us to examine not 
only the weaknesses in Head Start but the 
strengths, with the result being a more complete 
picture of the potential of early intervention. 

The Perry Piogram was conducted by re- 
searchers at the High/Scope Educational Research 
Foundation in Ypsilanti. Michigan (Berrueta- 
Clement. Schweinhart. Barnett. Epstein. & 
Weikart. 1984). Like many other experimental 
projects, the program operated for only a few 
years and involved a relatively small number of 
children. What piqued scientists' interests was 
that the participants were followed into adult- 
hood. with results reported for age 19 and re- 
cently forage 27 1 Schweinhart. Barnes. & Weikart. 
1993). What piqued public interest was a cost- 
benefit analysis which placed a monetary value on 
program effects. 

The High/Scope investigators estimated 
the savings to society resulting from better social 
competence among Perry graduates, including 
reduced grade retention and usage of the welfare 
and criminal justice systems. They also counted 
higher contributions to the tax base resulting from 
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higher employment rates. They reported that for 
every dollar spent on the preschool program, tax- 
payers received a savings of $3 to $6 by the time 
participants had reached 19 years of age (Barnett, 

1 985). By age 27, these savings were estimated to 
be $7 (Barnett. 1993). 

The High/Scope researchers cautioned 
against generalizing their findings to the national 
Head Start program, a warning issued earlier by 
the Consortium investigators. The media, how- 
ever, continued to devote attention to the Perry 
results, which were a lot more interesting and 
understandable than the positive but more esoteric 
reports on Head Start that were appearing in the 
scientific literature. Eventually the Perry Project 
came to be described as "Head Start-like." and its 
findings became attributed to early intervention in 
general and sometimes to Head Start itself. 

The High/Scope program focused on pre- 
school education, although there was a home 
visitation component to enable parents to rein- 
force the curriculum at home. Head Start also 
provides preschool education, but attempts to in- 
volve parents in many more ways and to deliver 
comprehensive services to children as well as their 
families. The Perry Project is therefore not at all 
“Head Start-like." And whether or not the two 
programs produce the same results is unknown. 
There has never been a report on Head Start's 
effects on college attendance or welfare usage, for 
example, and no one has ever conducted a cost- 
benefit analysis of the project (Haskins, 1989). 
Just considering the value of parental employment 
and preventive health care. Head Start may be 
more cost-effective than High/Scope. This is an 
empirical issue that must be addressed before 
comparative judgments of the two programs can 
be fairly made. 

Despite the lack of comprehensive, longi- 
tudinal data on Head Start, many reviewers of the 
early intervention literature have accepted the 



assumption that Head Start graduates generally do 
not do as well as those who attended other, usually 
university-based programs (Haskins. 1989; 
Kotelchuck& Richmond, 1987; Woodhead, 1988). 
In defense of Head Start, some have pointed out 
that the smaller projects were very carefully de- 
signed and implemented, with highly trained staff 
and substantial budgets. The annual per-child cost 
of the Perry Preschool, for example, was esti- 
mated to be $7,600 in 1992 dollars (Barnett, 
1993) — more than twice as much as Head Start 
spends. There is no way the national program 
could provide each child with more services with 
less than half the money. Further, the experimen- 
tal preschools typically operated in single sites 
under constant monitoring to ensure that the pro- 
grams were delivered in the manner intended. 
Head Start operates in nearly 1,400 different 
places where it is tailored to local circumstances 
and quality ranges from excellent to poor. 
Besharov has been particularly critical of the com- 
mon assumption that Head Start produces the 
same benefits as other programs precisely because 
they do not have the same level of quality. Assert- 
ing that “not all preschool programs are created 
equal,” Besharov believes that “Head Start has 
serious problems . . . that often prevent it from 
making a lasting impact on disadvantaged chil- 
dren” ( 1 992, pp. 52 1 , 520). 

Because the experimental preschools fo- 
cused almost exclusively on preschool education, 
the debate over generalizability of the evidence 
actually centers on the quality of the educational 
experience. The smaller programs typically con- 
tained an excellent curriculum delivered by well- 
trained teachers — features not found in all Head 
Start programs. The original planners purpose- 
fully avoided prescribing a national curriculum, 
so the quality of the educational component in 
each Head Start center essentially depends on 
local talent. It seems logical to assume that if all 
Head Start programs matched the educational 
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standards of. for example, the Perry Preschool 
(including teacher qualifications and compensa- 
tion), the same results would be achieved. 

We would carry this assumption a step 
further and argue that with an improved preschool 
education component. Head Start’ s benefits would 
surpass those of the Perry Project. In addition to 
education. Head Start contains the other elements 
that many theorists and much empirical data asso- 
ciate with effective intervention: attention to the 
needs of the whole child, attention to the needs of 
the family through support services, and the in- 
volvement of parents, who are the child’s first and 
most influential teachers (see National Head Start 
Association, 1990; Price, Cowen, Lorion, & 
Ramos-McKay, 1988; Schorr, 1988; Zigler & 
Berman, 1983; Zigler &Styfco, 1993). Even with- 
out this scientific support, it is logical to assume 
that a program that delivers quality educational 
and other needed services to children as well as 
their families will produce more benefits than one 
that provides a good education alone. 

A question that begs answering is why the 
Perry preschoolers benefited from a program that 
had such a narrow focus when the knowledge base 
indicates that broader services are required. The 
High/Scope investigators believe that their sound 
curriculum allowed children to enter school with 
better academic skills and attitudes, which elicited 
positive reactions from their teachers; these fur- 
ther enhanced their performance and led to better 
student-teacher interactions in subsequent grades 
(Berrueta-Clement et al„ 1984). This “snowball" 
hypothesis is consistent with Woodhead's sug- 
gestion that children who have quality preschool 
experiences are “better able to cope with the 
demands of schooling at a critical time when their 
identities within the education system [are] being 
established,’’ triggering “a more positive cycle of 
achievement and expectation" (1988, p. 448). 

An alternative explanation centers on parents. 



who also have a critical role in helping their 
children adjust to school. As a result of 1 to 2 years 
of weekly home visits, the Perry parents may have 
become more interested and involved in their 
children’s schooling (an outcome established for 
other programs, e.g., Hauser-Cram, Pierson. 
Walker, & Tivnan, 1991; Pfannenstiel, 1989). 
These attitudes, present at the sensitive time of 
transition to elementary school, could be what 
triggered the positive cycle of achievement. (See 
Hale. Seitz, & Zigler, 1990, and Seitz, 1990, for 
discussions that the High/Scope effects may be 
due to parents.) 

Anotherquestion raised by the High/Scope 
results concerns the expectations we should have 
of early intervention. Recently, preschool educa- 
tion has been widely heralded as a means to end 
crime and welfare dependency. This ridiculous 
claim was certainly fueled by the publicity given 
the Perry findings. Yet the High/Scope investiga- 
tors have pointed out that not all of their former 
students had good outcomes. Over half of the 
Perry graduates were arrested at least once by the 
time they were 27 years old, and 44% of the 
program females had been teen mothers 
(Schweinhart et al„ 1993). Longitudinal studies 
of other preschool projects have also shown that 
although children do better than they would have 
without the experience, they still do not approach 
the achievements of middle-class students (e.g.. 
Fuerst & Fuerst, 1993; Hebbeler, 1985). 

Such findings lead to the sobering conclu- 
sion that early childhood intervention alone can- 
not transform lives. Its positive effects can be 
overpowered by the longer and larger experience 
of growing up in poverty. Quality preschool edu- 
cation such as that provided in the Perry Project 
may prepare poor children for school and help 
them meet social expectancies during the school 
years and perhaps beyond. Quality, comprehen- 
sive services such as those delivered in many Head 




Start centers might achieve these and. as we are 
about to discuss, other important gains. But no 
program can enable children to develop optimally 
when their larger rearing environment is not con- 
ducive to optimal development. Early interven- 
tion can help, but it must not be considered more 
than one piece of the solution to the problem of 
poverty. 

Lessons Yet to Learn from Head Start 

Although the Perry Project was better than 
Head Start in terms of quality of preschool educa- 
tion (and in evaluation), in other respects Head 
Start is the superior program. The knowledge base 
clearly supports the value of Head Start's compre- 
hensive, family-centered approach. Ironically, al- 
though the range of services provided in the Perry 
program was relatively narrow, the outcome mea- 
sures used were broad, revealing the variety of 
behaviors affected by intervention. Of Head 
Start's many components, evaluators have de- 
voted the most attention to the preschool aspect 
and its long-term effects on IQ and achievement. 
Possible benefits to physical health, nutritional 
status, social interactions, parents’ child-rearing 
abilities, family functioning, parental empower- 
ment, and community responsiveness have 
been underevaluated and undervalued. 

The area in which Head Start has perhaps 
had its strongest impact is in physical health and 
well-being. Program performance standards re- 
quire that all enrolled children receive medical 
screenings, immunizations, dental exams, and 
corrective treatment if needed. A high percentage 
of them do (Brush. Gaidurgis. & Best, 1993; 
National Head Start Association, 1990; Zigler, 
Piotrkowski. & Collins, in press), making Head 
Start a major provider of health services to poor 
children. This role is now expanding, as the project 
has recently been authorized to extend health care 
to participants' siblings. Head Start is also becom- 



ing a major supplier of screening and diagnostic 
testing services (EPSDT) required for children 
covered under Medicaid. In addition. Head Start 
delivers many of the services to preschoolers with 
disabilities and their families, as mandated under 
the Individuals with Disabilities Education Act 
(PL 99-457). In fact, children with disabilities 
comprise over 13% of Head Start’s enrollment 
(Administration on Children, Youth and Families, 
1993), so the program has done well in meeting 
the needs of this underserved population. An- 
other health benefit derives from the nutritious 
meals and snacks served at each center. 

Although Head Start’s health benefits may 
seem obvious, little research has been done to 
document these effects. One study reported that 
Head Starters were just as likely as middle-class 
children to have inoculations and physical exams, 
and more likely to have seen a dentist (Hale et al., 
1990). Similar physical exam histories were also 
found for the two groups’ siblings. Despite the 
absence of more hard evidence, the value of Head 
Start's health services is really not in question. 
Each year hundreds of thousands of poor children 
receive some medical attention and nutritious food 
they otherwise might not have had. As the Clinton 
Administration searches for solutions to inad- 
equate health care and dismal inoculation and 
hunger rates among children, the Head Start 
model may offer some guidance. 

Head Start's positive impact on families is 
another outcome that has not received the atten- 
tion it merits. In 1992. 94% of enrolled families 
received needed support services through the pro- 
gram (Brush et al.. 1993). Each year thousands of 
low-income parents benefit financially by receiv- 
ing jobs and job training through Head Start. Over 
35% of the staff are parents of enrolled children or 
graduates (Brush et al., 1993), and many of them 
have earned Child Development Associate (CDA) 
credentials and have entered careers in early child- 
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hood education (Collins, 1 990). Parents have also 
gained on a personal level, reporting improved 
relationships with their children (National Head 
Start Association. 1990) arid greater life satisfac- 
tion and psychological well-being resulting from 
the supportive social network of the preschool 
community (Parker, Piotrkowski. & Peay, 1987). 

Another possible effect on families is that 
siblings benefit from their parents' involvement in 
Head Start and the support services the family 
receives, as has been found for other interventions 
(reviewed by Seitz & Apfel. in press). It certainly 
makes sense that if parents are encouraged to be 
actively engaged in their children's education, and 
if they have help resolving at least some personal 
or practical problems, they may become better 
socializers of all their children. If so, the standard 
empirical approach that evaluates only the target 
child may greatly underestimate the benefits of 
Head Start. 

Another underresearched area is the im- 
pact of Head Start on local communities. A few 
studies (reviewed by McKey et al.. 1985) have 
shown that the presence of a Head Start program 
enhances a community's capacity to meet local 
needs. In one survey, investigators identified al- 
most 1 ,500 institutional changes in the health care 
and educational systems in 48 communities after 
Head Start centers opened ( kirschner Associates, 
1970). A more recent General Accounting Office 
(1992) report praised Head Start's methods of 
linking families with local services, judging this 
approach far more successful than efforts to create 
new services or delivery mechanisms. The GAO 
extended uncharacteristic approval by suggesting 
that “Congress may wish to consider promoting 
service-oriented efforts like Head Start" (p. 6). 

Clearly. Head Start has the potential to 
enhance many systems that influence the course 
of a child's development. Judgments of the pro- 
gram based solely on academic indicators are 



therefore simplistic. To get a true picture of Head 
Start’s effects on developmental outcomes, a 
broader empirical approach is required. 

Policy Directions: Issues of 
Quantity and Quality 

Despite its limitations, the literature on 
early childhood intervention points in two clear 
directions: to improve the life outcomes of chil- 
dren in poverty, programs must be more sustained 
and of high quality. We have learned that there are 
no magical periods of development when children 
can be inoculated against the ravages of social and 
economic disadvantage. Every period of develop- 
ment is important and requires appropriate envi- 
ronmental nutrients. Thus, to be more effective, 
intervention should begin earlier and last longer 
than the year or 2 before the child enters school. 
Further, not just any program will do. Only those 
that provide good services can be expected to 
achieve consistently good results. These lessons 
have important implications for the future of Head 
Start. 

Dovetailed Programming 

When Head Start began, a widely held 
belief was that a quick dose of environmental 
enrichment would immunize children against the 
effects of poverty. Fortunately, Head Start's plan- 
ners did not subscribe to the inoculation model. 
They knew that no program could do much for a 
child in the course of one summer, which was the 
duration of Head Start in its first incarnation. Not 
only was the program extended to last for a school 
year, but planning got underway to serve children 
both before and after the preschool stage. Just 2 
years after Head Start began, the first Parent and 
Child Centers (PCCs) were opened to provide 
supportive services and parent education to fami- 
lies and children from birth to age 3. That same 
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year. Follow Through was initiated to continue 
Head Start services through the early grades of 
elementary school. Plans were for this to be a 
national project of the same scope as Head Start, 
but the escalating costs of the Vietnam War de- 
pleted the expected funding before the demonstra- 
tion phase was completed (see Doernberger & 
Zigler. 1993). The concept survives as a tiny 
experiment in planned curriculum variation oper- 
ating in less than 40 sites across the country. 
«Sadly, after nearly 30 years, both of these efforts 
to extend Head Start have remained undeveloped 
and their potential gone unrealized. 

There are a few encouraging signs. The 
number of PCCs has been expanded from 36 in 
1989 to 106 today. The centers still operate with- 
out consistent performance standards, however, 
so there is no way of knowing what services are 
generally delivered or how well they meet the 
needs of participating families. 

The first author has been working with the 
Carnegie Task Force on Meeting the Needs ol 
Young Children to advise the Clinton administra- 
tion on planning a 0 to 3 Head Start, possibly 
housed in existing PCCs. The exact details of the 
program would be worked out by a planning 
committee of experts from various disciplines 
similar to that which developed the current Head 
Start. Services for at-risk families would begin 
prenatal ly and include nutrition, health care, 
parenting education, and family support. (A child 
care component is also being suggested because of 
the dearth of quality infant care despite the grow- 
ing need.) The rationale for the project is clear: 
waiting until a child is 3 or 4 years old is waiting 
too long. Preventive services can be more effec- 
tive than remedial ones. Children who are healthy, 
have sound rela'ionships with their primary 
caregivers, and who have received adequate nur- 
turing and stimulation will have the socioemotional 
foundations needed for learning in preschool and 
beyond (Zero to Three. 1992). 



A common complaint about Head Start is 
that graduates soon lose the academic and cogni- 
tive gains they made in preschool. This criticism is 
more appropriately directed toward the elemen- 
tary schools they attend, for this is where the 
preschool advantage fades. To continue the mo- 
mentum toward success, there must be a smooth 
transition from the preschool to the school envi- 
ronment and coordinated programming that builds 
upon prior learning. 

The Head Start-Public School Early Child- 
hood Transition Project (described by Kennedy, 
1993) is the latest attempt to continue services in 
grade school. The project, which was legislated in 
the Human Services Reauthorization Act of 1990 
and is now operating in 32 demonstration sites, 
follows Head Start graduates from kindergarten 
through grade 3. Local Head Start and public 
school personnel work to introduce each child and 
family to the new school experience and to famil- 
iarize kindergarten teachers with the child’s 
progress, program, and needs. Comprehensive 
services, parental involvement, and family sup- 
port are continued for the next 4 years. Early 
childhood educators often express skepticism 
about linking preschools and schools because they 
fear there will be a downward extension of formal 
educational practices that are not appropriate for 
very young children. In the Transition Project, 
curricula and pedagogies must be coordinated 
between the two levels of schooling. This should 
mean that the individualized, developmental na- 
ture of the preschool program will be extended 
upward, making the two school experiences less 
fragmented for young learners. 

A small but convincing body of evidence 
supports the premise of the Transition Project, i.e.. 
that longer, coordinated intervention produces 
longer lasting gains. When a very intensive inter- 
vention, the Abecedarian Project ( Horacek, Ramey. 
Campbell. Hoffmann, & Fletcher, 1987), was fol- 
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lowed by a school-age program through second 
grade, children continued to do better than peers 
who had not participated; classmates who experi- 
enced only the early intervention no longer had a 
statistically significant advantage over controls 
(Ramey & Campbell, 1991; Seitz, 1990), and 
those who received only the school-age program 
did not benefit substantially. An intensive reading 
curriculum in Success for All resulted in substan- 
tial gains for children who began the program in 
preschool, kindergarten, or first grade, and these 
gains magnified each year through third grade 
( Madden. Slavin. Karweit. Dolan. & Wasik, 1991 ). 
The results were much less pronounced for chil- 
dren who encountered the new curriculum after 
first grade, indicating that continuity from one 
reading level to the next was important for build- 
ing program effects. 

A program more similar to Head Start, the 
Chicago Child-Parent Centers, provided compre- 
hensive services and required parental involve- 
ment for 1 to 2 years of preschool, kindergarten, 
and from 1 to 3 years of elementary school 
(Reynolds, 1992). Children who attended the pre- 
school and kindergarten did not differ from con- 
trols by the time they were in fifth grade. However, 
fifth graders who had received 4 to 5 years of the 
intervention (preschool through second grade) 
had higher achievement scores and less grade 
retention, with no fade-out evident. Reynolds con- 
cluded that the transition between kindergarten 
and the primary grades is a crucial time, and that 
at least 2 years of follow-up programming in 
elementary school are required to sustain benefits. 

A few studies suggest that the gains pro- 
duced by extended intervention can last into high 
school and beyond. Initially. Abelson. Zigler. and 
DcBlasi ( 1974) found that Head Start graduates in 
their fourth year of Follow Through did better on 
IQ. achievement, and social-motivational mea- 
sures than did their preschool classmates who 



attended traditional school programs. A follow-up 
showed these graduates maintained theirsuperior- 
ity in several areas through grade 9 (Seitz., Apfel, 
Rosenbaum, & Zigler, 1983). In another study of 
the Chicago Child-Parent Centers, children who 
attended for 4 to 6 or more years had better high 
school graduation rates. For the overall sample, 
62% received diplomas compared to about 49% of 
controls (Fuerst & Fuerst, 1993). Among students 
who had 7 to 9 years of intervention, 85% of girls 
and 70% of boys graduated. Center students who 
eventually dropped out did so 18 months later than 
other dropouts in the school system. Finally, a 
follow-up of children who had participated in the 
Deutschs' early enrichment program from pre- 
school through third grade showed that by young 
adulthood, the boys had higher education and 
employment status and some stronger academic 
skills than a control sample (Jordan, Grallo, 
Deutsch, & Deutsch, 1985). The researchers at- 
tributed the loss of the advantage for girls to a 
more difficult transition to the regular fourth- 
grade classes, where their active learning orienta- 
tions were not harmonious with traditional sex- 
role expectations. 

This research has convinced us that the 
Transition Project will be effective in helping 
Head Start graduates maintain their preschool 
advantage. If evaluation of the demonstration 
phase confirms this expectation, it will be com- 
pelling to move the project into the educational 
mainstream. We have developed a plan to do so 
using current federal education expenditures 
(Zigler & Styfco, 1993). A large part of the De- 
partment of Education's budget (over $6 billion 
annually) is spent on Chapter 1 of the Elementary 
and Secondary Education Act of 1965 (ESEA). 
Chapter 1 is a compensatory education program 
for economically and "educationally” deprived 
children in preschool through grade 1 2. Originally 
intended to enhance the educational services of 
impoverished school districts, the program now 
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operates in the majority of the nation’s schools — 
mostly as a pull-out program offering remedial 
instruction to children who have fallen behind the 
academic expectations of their grade level. 

There has been little evaluation of Chapter 
I considering the size of the program, but what 
there is shows that most students do not exhibit a 
meaningful gain in achievement (see Arroyo & 
Zigler. 1993). The reasons for the lackluster re- 
sults may be in the program’s design: services are 
remedial rather than preventive and narrow in- 
stead of comprehensive; parental involvement is 
minimal; and health and family problems that can 
interfere with school performance are not ad- 
dressed. The ESEA will be reauthorized in 1993, 
and suggestions to revamp the program have fo- 
cused on training teachers and narrowing the tar- 
get population (Commission on Chapter I. 1992). 
Although such efforts may do some good, they do 
not bring the elements of effective intervention to 
Chapter I . To make a difference in the education 
of low-income children, we must put aside the 
ineffectual educational model of Chapter 1 and 
adopt on a large scale the proven model of compre- 
hensive. family-focused services. 

Our proposal is for Chapter I to follow 
Transition program plans and become the school- 
age version of Head Start. As Head Start eventu- 
ally expands to serve all eligible children. Chapter 
I can continue their intervention in grammar 
school. Coordinated curricula and continued pa- 
rental involvement and comprehensive services 
will then be firmly placed in schools that sene 
populations below the poverty line. On another 
level. Head Start and the Chapter I Transition will 
be two parts of a coherent federal policy to meet 
the needs of poorchildren beginning in preschool. 
Based on a solid knowledge base and big enough 
to have an impact, this new face for Chapter I 
holds promise for halting the lade-out of the Head 
Start advantage. 



Renewed Emphasis on Quality 

The studies involved in the Consortium, 
including the Perry Preschool, and others that 
were carefully designed, implemented, and evalu- 
ated (e.g.. Price et al., 1988). prove the obvious: 
only good programs produce good results. For 
example, a review of early intervention projects 
led Weikart and Schweinhart to conclude that 
"only high quality programs consistently show 
success” ( 1 99 1 , p. 58). This fact may explain why 
some studies of Head Start show lasting benefits 
while others do not. One criticism of the Head 
Start Synthesis Project ( Me Key etal., 1 985), which 
confirmed the fade-out observation, is that the 
analysts paid no attention to the differences in the 
Head Start programs in the studies they synthe- 
sized (Gamble & Zigler. 1989). Findings from 
studies of excellent programs were combined with 
those of poorer programs, diluting what may pos- 
sibly have been robust effects for the former. 

Quality problems are not new but have 
plagued Head Start since its hasty beginnings. In 
a matter of a few months, the project was trans- 
formed from an idea before the planning commit- 
tee to a national summer program serving over 
one-half million children. Starting off so big and 
so fast left quality controls behind, and the pro- 
gram has been trying to catch up ever since. 
Program performance standards were not imple- 
mented until 1975. They dictate what services 
each program must provide and remain the princi- 
pal vehicle for monitoring quality. The regional 
offices have primary responsibility for assuring 
that grantees adhere to the standards, but regional 
staffing has declined considerably over the years 
even as Head Start has become larger. One former 
regional director told us that his oversight staff 
decreased from al least 50 persons in 1 978 to 1 2 in 
1992. and travel money for site visits declined 
more than 909} . His experiences arc not atypical; 
nationwide only one in five centers was monitored 
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in 1988 (Chafel. 1992). although this level im- 
proved to 35% in 1992 (Horn. 1993). The best 
standards in the world are meaningless if they are 
not enforced. 

Over the years many experts have studied 
the problems in Head Start and proposed thought- 
ful solutions. For the project's 15th anniversary 
the first author chaired a major advisory commit- 
tee convened to chart future directions (U.S. De- 
partment of Health and Human Services, 1980). 
The report stressed areas where improvement was 
needed and made recommendations for corrective 
action. The committee's successor on the 25th 
anniversary, the Silver Ribbon Panel, found that 
many of the same problems were still there and 
had worsened over time (National Head Start 
Association. 1 990). Other thorough analyses have 
similarly concluded that upgrading quality must 
be a top priority (e.g., Chafel. 1992; Washington 
& Oyemade, 1987; Zigler, Styfco, & Gilman, 
1993). 

Of all the program components, parental 
involvement and family support have been the 
most neglected. In many programs there are no 
coordinators for either component (or for health), 
and these positions are often combined with others 
because personnel budgets have fallen short of 
inflation (National Head Start Association, 1990). 
In 1991-92, social service staff had average 
caseloads higher than 94 children — more than 
three times the number recommended (Inspector 
General. 1993a). Between 1987 and 1991. train- 
ing institutes were held in every area but parental 
involvement (National Head Start Association, 
1991 ) — a void not addressed until 1993. Lack of 
attention to this component is particularly worri- 
some not only because participation has such 
potential benefit to parents (Nelson, 1991), but 
because they can support the intervention goals 
long after the formal program. 



The biggest threat to Head Start quality is 
posed by staffing problems. Caregiver stability and 
level of skill have been identified as essential to 
quality care and directly responsible for positive 
developmental outcomes in children. In the Na- 
tional Child Care Staffing Study, quality of care 
was linked to staff wages and turnover rates 
( Whitebook, Howes, & Phillips, 1 989). Low wages 
led to high turnover, disrupting continuity in the 
child’s experiences and harming social and verbal 
development. In a nationwide survey of Head Start 
centers in 1 988, Collins ( 1990) reported that teach- 
ers received an average salary of about $12,000, 
only 6 1 % of the average compensation of teachers 
in public schools. In 1 992, starting salaries for Head 
Start teachers averaged only $ 1 2,077 (Brush et al„ 
1993), about half the public school average. De- 
spite poor salaries. Head Start has been spared from 
high attrition apparently because staff are commit- 
ted to their jobs. Collins found annual turnover in 
Head Start to be about 25%, considerably lower 
than that in other early care and education pro- 
grams. Yet a more recent report showed that in 
Head Start centers that do experience turnover, the 
rate is higher than 60% (Kisker, Hofferth, Phillips, 
& Farquhar, 1991). Quality care cannot be pro- 
vided under such conditions. 

Critics are quick to note that funding could 
not possibly be the cause of Head Start' s problems 
because its budget has skyrocketed, jumping from 
$ 1 .2 billion in 1 989 to $2.8 billion in 1 993. But if 
we look at the numbers prior to the recent expan- 
sion increases, we can see why quality has some- 
times been compromised. Head Start barely held 
its funding level until the positive results of the 
Consortium studies were released. Then the project 
received its first substantial budget increase and 
was placed in a “safety net” where it would be 
spared spending reductions the Reagan Adminis- 
tration imposed on most federal programs. How- 
ever, cuts in federal food and other programs that 
centers relied upon began to erode their ability to 
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maintain services even at previous levels (Wash- 
ington, 1985). Despite increasing appropriations, 
the average expenditure per child fell by more than 
$400 in constant dollars, or 13% in inflation- 
adjusted dollars, between 1981 and l989(Rovner, 
1990). It is no wonder that a recent Inspector 
General report ( 1993a) showed that in some Head 
Start programs, participants do not receive even 
basic services such as all their immunizations. 
Despite questions about the criteria used in the 
report, and indeed whether the centers surveyed 
were representative, there is no doubt that some 
Head Start centers deliver inadequate services. 

Years of insufficient funding are of course 
not the only reason Head Start has been struggling 
to maintain quality. The population it serves and 
their needs have changed. America is a different 
place than it was in 1965. The face of poverty has 
grown uglier. The environments where poor chil- 
dren are raised now involve more homelessness, 
street violence, illegal drugs, and young, single- 
parent families. Children who were born with the 
HIV virus or exposed to crack are now showing up 
in Head Start centers. The number of dysfunc- 
tional families with myriad needs has multiplied. 
The extra attention required by these children and 
families has burdened already overworked staff 
and undoubtedly deprived some participants of 
the individual help they need (Inspector General, 
1989, 1993b). 

Quality can no longer be threatened or 
overlooked, especially now that Head Start is 
serving increasing numbers of children and fami- 
lies. The Silver Ribbon Panel warned that “Pro- 
gram expansion should never occur at the expense 
of quality” (National Head Start Association, 1990, 
p. 35). Supporting their conclusion, the first au- 
thor testified before the U.S. Senate ( 1990, p. 49) 
that “Head Start is effective only when quality is 
high. . . . Below a certain threshold of quality, the 
program is useless, a waste of money regardless of 



how many children are enrolled.” Policymakers 
have good reasons to want to open Head Start to 
every eligible child, but they must assure each of 
them a good Head Start experience. 

Planning for a Better, Bigger Program 

Lawmakers heeded the advice of experts 
when they wrote a quality set-aside into the Hu- 
man Services Reauthorization Act of 1990. After 
adjustments for inflation, 25% of expansion funds 
are to be used for improvements. Half of the set- 
aside is reserved for increased salaries and ben- 
efits. The rest is marked for training and technical 
assistance, facility improvements, and transporta- 
tion. The Act also requires that by 1994, each 
center is to have at least one teacher with a CDA 
credential. While a step in the right direction, this 
mandate shows how far Head Start has fallen 
behind over the years and how far it has to go to 
catch up with current professional practices. Few 
middle-class parents would enroll theirchildren in 
a preschool program where the teachers did not 
have college degrees and certification in early 
childhood education. Head Start is supposed to be 
a model program and should also have qualified 
teachers who are paid a worthy wage. 

Although the plans to improve Head Start 
are good ones, they cannot be implemented over- 
night. Training staff and locating facilities, for 
example, can be very time consuming. But pro- 
gram directors are also under a great deal of 
pressure to serve more children, which can drain 
the energies needed for trying to serve them better. 

Another I nspector General ’ s report ( 1 993b ) 
and discussions with program directors tell the 
story. In some areas Head Start centers have been 
unable to recruit all the children they are supposed 
to enroll with expansion monies. A widespread 
problem is the inability to find qualified staff 
because they are either unavailable or will not hire 



on at the low pay levels. Space is a major problem, 
with some centers having to operate with double 
sessions because new classrooms cannot be found. 
This practice reduces time for planning daily pro- 
grams and attending to family needs. Many pro- 
grams have had a difficult time raising the local 
match to their sudden budget increases. 

Expansion is clearly proceeding without a 
plan. The effort is of such magnitude and impor- 
tance that one must be implemented quickly so 
that growth does not threaten efforts to enhance 
quality. Thankfully, both the White House and 
Donna Shalala. Secretary of Health and Human 
Services (HHS), have voiced a commitment to 
thoughtful growth and improving quality. Shalala 
appointed the Advisory Committee on Head Start 
Quality and Expansion, charged with recommend- 
ing the priorities and pace for expansion. Their 
work also focused on short- and long-term plans to 
strengthen services. The Committee's recommen- 
dations, now in various stages of review and 
implementation within Congress and the agencies 
responsible for Head Start, have at the very least 
begun the planning process in earnest. 

Although much attention is now being 
devoted to poor service delivery in some Head 
Start centers, expansion and improvement efforts 
must apply to administration as well. Some ad- 
ministrative responsibilities have been scattered 
throughout HHS and should be brought back to 
the central Head Start office. For example, staff- 
ing for health care was moved to the Maternal and 
Child Health Division, leaving a void in leader- 
ship for this vital program component. Regional 
Head Start personnel no longer report directly to 
Head Start's top managers, so there is not a clear 
managerial relationship between the national 
headquarters and those who are responsible for 
carrying out the program. And. as noted earlier, 
regional staffing levels have been greatly reduced 
over the years, so there are fewer people to oversee 
more grantees. To have a better Head Start pro- 



gram. we must have a well-managed one that is 
sufficiently staffed to accomplish its charge. 

Plans to improve Head Start must also 
address the future. The project’s success rests on 
its capacity to meet the needs of impoverished 
children and their families; as their needs change, 
so must the program. The Advisory' Panel for the 
Head Start Evaluation Design Project (1990) pre- 
sented a blueprint for needed research to guide the 
development of better services and delivery 
mechanisms. Their recommendations focused on 
studying which program services work best and 
for whom. This overriding question suggests sev- 
eral circumscribed ones: What mediates program 
effects? Are 2 years of Head Start more or less 
cost-effective than 1 ? Do all participants need all 
program services? 

Another question — one that can no longer 
be answered by generalizing from the findings of 
other programs — is just how wide-ranging and 
lasting are Head Start's benefits? In 1990 Con- 
gress entertained plans to conduct a 20-year 
longitudinal study to include competence indica- 
tors such as rates of criminality, welfare use, and 
employment status of Head Start graduates and 
their families. Unfortunately, this much needed 
work was never funded. Hopefully the current 
controversy over the value of Head Start, raging at 
a time when substantial monies are being granted 
for expansion, will resurrect the commitment to 
research. 

This is not an empty hope. After 30 years 
of experience with early intervention, good ideas 
and plans for empirical study have been devel- 
oped. A committee charged with implementing 
the Design Project's blueprint is now in session. 
And Secretary Shalala has repeatedly acknowl- 
edged the need for scientific guidance to enhance 
program effectiveness. Head Start is finally mov- 
ing toward long-awaited expansion and improve- 
ment of service quality. These efforts must be 
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anchored in the best knowledge available if Head 

Start is to fulfill its promise of improving the lives 

of children and families in poverty. 
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On August 12. 1983. Judy Johnson, the mother 
of a toddler at a prestigious nursery school in 
Manhattan Beach. California, told police that her 
2-year-old son had been molested by Raymond 
Buckey. a teacher and the grandson of the school's 
founder. Virginia McMartin. Buckey was arrested 
but subsequently released for lack of evidence. On 
March 22. 1984. he was indicted by a grand jury 
and rearrested along with six female teachers, 
including his mother. Peggy McMartin Buckey. 
He was held without bail until 1989. 

In August. 1984. the first of many prelimi- 
nary hearings had begun for the seven McMartin 
Preschool defendants. Fourteen former student sat 
the nursery school took the witness stand at this 
hearing and described a series of bizarre events 
involving sexual abuse, satanic rituals, and animal 
mutilation that allegedly occurred at their pre- 
school. Based on these children's testimony, the 
defendants were accused of 1 13 counts ol abuse, 
later expanded to 321 counts, including rape, 
sodomy, fondling, oral copulation . and drugging. 



and photographing of at least 100 children in the 
nude. 

In January 1986. after 17 months of prelimi- 
nary hearings in which each child witness was 
cross-examined by each of seven different de- 
fense attorneys, charges against five of the six 
women were dropped because of insufficient evi- 
dence. Only 26-year-old Raymond Buckey and 
his 38-year-old mother. Peggy McMartin Buckey. 
remained as defendants. After spending 2 years in 
jail. Peggy McMartin Buckey's bail was set at 
$300,000 in 1986. while her son remained re- 
manded without bail in a proceeding that would 
last an additional 4 years. 

In the ensuing years of legal proceedings, the 
major issue before the court w-as whether or not to 
believe the children. On the one hand, it was 
argued that the children's reports were authentic 
and that their bizarre and chilling accounts of 
events, which were well beyond the realm of most 
preschoolers' knowledge and experience, only 
served to substantiate the fact that the children had 




actually participated in them. On the other hand, it 
was argued that the children's reports were the 
product of repeated suggestive interviews by par- 
ents. law enforcement officials, and therapists, 
and that the children were only reporting events 
suggested to them during these interviews. 

In January 1990. following a 33-month trial 
and 9 weeks of deliberation, the jury in the 
McMartin Preschool sexual abuse case returned 
“not guilty" verdicts on 32 of the 63 counts. The 
jury deadloekedon 1 2 molestation charges against 
Raymond Buckey and on I count of conspiracy 
against his mother. Judge William Pounders dis- 
missed the conspiracy charge. 

In response to the acquittals, the children’s 
parents railed at the way the case had been handled 
and at the jury's verdicts. The children themselves 
appeared on nationally televised talk shows, weep- 
ing over the jury’s seeming refusal to believe their 
claims of ritualistic abuse. Newspapers and maga- 
zines across the nation ran headlines such as: 
"Doubt the children and jail the parents!" 

During a postverdiet press conference, many 
jurors claimed that they believed that some of the 
children had been abused, but were unable to 
reach a guilty verdict because of the suggestive 
way the children had been interviewed. These 
jurors claimed that the social workers prevented 
the children from speaking in their own language 
and thus diminished their credibility. 1 

By January 1 990. the prosecution announced 
Buckey would be retried on the remaining 12 
charges on which the jurors had been deadlocked. 
Although this trial was shorter, the jury again 
deadlocked. When the judge declared a mistrial, 
the prosecution did not to retry the case. 

The McMartin case made legal history. In 
sheer magnitude, it was without parallel, lasting 7 
years from the time of its inception to the final 
verdicts, producing hundreds of thousands of pages 



of transcript, and costing the State of California 
overS 1 6 million. From the very start, the McMartin 
case captured the attention of the national media, 
with regular accounts of the children's allegations 
appearing on television (e.g.. ABC News Nightlhw. 
April 20. 1984). in newspapers (e.g.. Churl icr& 
Downing. 1988: Shaw. 1990) and in magazine 
articles (e.g.. Fischer. 1988). 

The McMartin case is not a singular happen- 
ing. There have been many similar eases in North 
America and Europe, some of which have re- 
ceived extensive media attention. : For the most 
part, these cases share the following elements: 
First, the witnesses were preschoolers at the time 
of the alleged abuse. Second, the disclosures were 
not made immediately following the alleged event, 
but after a long delay. Third, the disclosures often 
were preceded by intensive interviewing of the 
children by various professionals (e.g.. child pro- 
tective service workers, law enforcement) and 
nonprofessionals (e.g.. parents, grandparents). 
Fourth, the children were the only witnesses to 
these alleged events, and corroborative physical 
evidence was lacking. Fifth, none of the defen- 
dants ever made a confession: all maintained their 
innocence, even after some co-defendants were 
convicted. Finally, the major issue before the jury 
in all of these cases was whether to believe the 
children. 

Researchers in child development have served 
as witnesses or consultants for the defense and/or 
the prosecution in all of these trials. More impor- 
tantly. these cases have changed the course of our 
research and thinking on children’s memory de- 
velopment. They have encouraged researchers to 
tackle new issues, to develop innovative experi- 
mental paradigms, and to challenge and elaborate 
previous research on the reliability of young 
children's statements. 

In this Social Policy Report we provide a 
glimpse into the social science research that has 
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accumulated on the aspect of children's testimony 
that figured so prominently in the McMartin case 
and hundreds of others like it.' namely, 
preschoolers' presumed suggestibility. We then 
present some tentative thoughts about the policy 
implications of this research, by addressing the 
questions: What can (and should) researchers and 
mental health professionals tell courts when they 
are called upon to serve as expert witnesses and 
consultants? What is the proper role for profes- 
sional organizations to play in overseeing expert 
testimony? First, however, we provide some back- 
ground information about the problem of child 
sexual abuse in the U.S. and about the history of 
children providing courtroom testimony. 

Prevalence of Abuse and Court Involvement 

Crime statistics reflecting the sexual abuse of 
children are of great social concern. These statis- 
tics come from two major sources: one is based on 
annual rates of "substantiated" or "indicated" re- 
ports of child abuse. 4 and the other is based on 
adults' reports of abusive events during their own 
childhoods. The first source thus provides esti- 
mates of incidence \ whereas the second provides 
estimates of prevalence. 

According to the most recent incidence fig- 
ures (based on data from 45 states), there were 2.7 
million reports of suspected child maltreatment in 
1991 (National Center for Child Abuse and Ne- 
glect. 1993); 129.697 of these were substantiated 
or indicated cases that were sexual in nature, 
indicating an incidence of childhood sexual abuse 
of less than I c /c of children for that year. The 
accuracy of these rates has been challenged, (e.g., 
Besharov, 1991 ; Robin, 1991 ). On the one hand, 
these figures may overestimate the extent of child 
sexual abuse because they include indicated and 
substantiated cases which are not validated. Oth- 
ers. however, argue that these rates may underes- 



timate the incidence of child sexual abuse, be- 
cause many cases of actual child sexual abuse end 
up being classified “unsubstantiated,” or signifi- 
cant numbers of cases of abuse are never reported 
to authorities. It seems plausible that while some 
substantiated cases are actually false, many more 
unsubstantiated and unreported cases are real. 
Thus, the national data likely underestimate the 
true incidence, although no one can say by how 
much. 

Random samples of adults, asked about their 
childhood history, yield highly variable estimates 
of childhood sexual abuse, ranging, for females, 
from 6.8% (Siegel, Sorenson, Golding, Burnam, 
& Stein, 1987) to 62% (Wyatt, 1985); and for 
males, from 3% to 31% (see Peters, Wyatt, & 
Finkelhor, 1986). A number of methodological 
factors may account for these discrepant figures, 
but review of these is beyond the scope of this 
article. Our point is that even if the lowest preva- 
lence rates are the most accurate, and even if the 
incidence of child sexual abuse in 1991 was 1%, 
this still represents a serious societal problem. 

As a result of society's reaction to these 
dramatic figures, and particularly in reaction to the 
ineffective prosecution of child abuse cases, the 
legal system has been forced to change some of its 
rules concerning the admissibility and treatment 
of child witnesses. For example, until recently 
courts of law in all English-speaking countries 
were reluctant to accept the uncorroborated state- 
ments of child witnesses (Chadbourn, 1 978). This 
reluctance was reflected in competency hearings, 
corroboration requirements, and cautionary in- 
structions that some judges gave to juries concern- 
ing the risk of convictions based solely on the 
testimony of child witnesses (Andrews. 1964; 
Cohen. 1975). During the 1980s, however, all but 
a few states dropped their corroboration require- 
ment for children in sexual abuse cases, a crime 
that bv its nature lacks corroboration. Seventeen 
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states now allow children to testify regardless of 
the nature of the crime, permitting the jury to 
determine how much weight to give to the child's 
testimony. With the continued adoption by states 
of Federal Rules of Evidence, the number of child 
witnesses is likely to expand. 

As more and more children have been admit- 
ted as witnesses in the courtroom, legal and court- 
room procedures have been modified. For ex- 
ample. some courts have instituted shield laws 
which permit a child witness to testify either 
behind a one-way screen or over closed-circuit 
television, to occlude the child's view of the 
defendant but not the defendant's view of the 
child. Hearsay exceptions are also allowed, 
whereby therapists, pediatricians, and others de- 
scribe what children have said to them. These 
measures serve to assist child witnesses who oth- 
erwise might be "psychologically unavailable” to 
testify in open court ( McGough. in press: Montoya. 
1992. 1993). 

In light of claims that many of these modifi- 
cations challenge the constitutional rights of de- 
fendants (Maryland v. Craig, 1990; Coy v. Iowa. 

1 988 ). it is important to determine whether such 
procedures do. in fact, facilitate the accuracy of 
children's testimony (Montoya. 1993). This is 
particularly important in light of the fact that 
recent court decisions regarding the treatment of 
child witnesses have not been predicated exclu- 
sively on humane issues (i.e.. reducing the stress 
placed on child witnesses), but also on the pre- 
sumption that courtroom modifications will in- 
crease testimonial accuracy (Harvard Law Re- 
view Notes. 1985; Montoya. 1992). No scientific 
data addressing these issues are yet available, 
although some data on the costs and benefits of 
courtroom innovations on children's courtroom 
behavior have been gathered ( Batterman-I -amice 
A Goodman. 1993; Min. 1993). 



There are. however, no reliable national data 
on the impact of these changes on the number of 
children who actually end up participating in fam- 
ily court or criminal justice proceedings. Gray's 
recent analysis ( 1993) of eight jurisdictions sug- 
gests that 39} to 109} of all cases of sexual abuse 
that are eventually filed with police result in atrial. 
Based on the National Center on Child Abuse and 
Neglect ( 1993) statistics cited above, this would 
suggest that, in the 45 states reporting, up to 
1 3.000 children testified in sexual abuse trials that 
year. For various reasons, however, the actual 
incidence of court involvement may be consider- 
ably higher. The majority of cases that end in pleas 
still require the child to be deposed even if the 
child does not testify in a court trial. Second, these 
figures do not include data from five states, one of 
which is New York where as many as 3.150 
children had formal court involvement in sexual 
abuse cases in 1990 (Doris, 1993). Finally, if 
nonsexual types of abuse and nonabuse cases 
involving children serving as witnesses arc in- 
cluded (e.g.. eases of domestic violence, custody 
disputes, accidents, playground injuries), then the 
estimate of children's participation in the legal 
system risesconsiderably. possibly to over lOO.OOO 
cases annually. We must emphasize, however, 
that this is. at best, an educated guess. 

One final point about children's court in- 
volvement is in order. It appears that preschoolers 
are disproportionately more likely to be abused 
and more likely to have theircaseseome to trial. In 
an analysis of a sample of nearly 800 alleged 
victims of child sexual abuse in New York, 
preschoolers (ages 6 and younger) accounted for 
nearl y 40G of the official sexual abuse cases, and 
28N were aged 5 and younger (Doris. 1993). In 
Gray's recent analysis ( 1993). children below the 
age of 8 accounted for 459} of sexual abuse cases, 
and 1 8' } were 5 years old or younger: 3 1 ( '< of the 
cases involv ing 5- to (wear-old children went to 
trial, and 24‘ < of cases involving 3- to4-yeai -olds 



went to trial, whereas only l() r /r of the cases 
involving 13- to 14-year-olds went to trial. 

Despite modifications in the judicial system 
resulting in the greater court involvement of chil- 
dren, both jurists and social scientists continue to 
raise fundamental questions about whether these 
changes actually facilitate the accuracy of 
children's testimony (Montoya, 1992; 1993) and. 
more broadly, whether the testimony that children 
do give is accurate. Next, we discuss the research 
that has been carried out on one important aspect 
of the accuracy of children's reports: the degree to 
which very young children are disproportionately 
prone to suggestion. 

Research on Children's Suggestibility: 

Past and Present Trends 

The scientific research on the suggestibility 
of children's recollections is both contradictory 
and confusing. A review of 20th-century studies 
of children's suggestibility can be found in Ceci & 
Bruek. 1993. Our purpose in this report is not to 
recap that analysis but to highlight some of the 
salient conclusions of these studies and to focus on 
the different experimental approaches used in 
investigating children's suggestibility. 

luirly Studies oJSnyycstihility 

Harly studies of children's suggestibility, with 
few exceptions, led to a jaundiced portrayal of 
children's proneness to suggestibility. Beginning 
with the early experiments of Binet and his Euro- 
pean colleagues (Binet. 1900; Lipmann. 1911; 
Stern. 1910; Varendonck. 1911). and concluding 
with empirical studies in the 1920s and 1930s 
(Messersehmidt. 1933; Otis. 1924; Sherman. 
1923). early researchers v iewed children as ex- 
tremely susceptible to leading questions and un- 
able to resist an interviewer's suggestions. M. R. 
Brow n ( 192b). a legal scholar, wrote; 



Create, if you will, an idea of what 
the child is to hear or see. and the 
child is very likely to see or hear 
what you desire, (p. 133) 

Although the conclusions of these early re- 
searchers were confirmed by studies conducted 
right up until the 1980s. modern researchers have 
been ambivalent about generalizing these results 
to the forensic arena for several reasons. First, 
despite the fact that there is great concern cur- 
rently about the reliability of preschoolers' re- 
ports. not one study in the first 80 years of this 
century included preschoolers. More recent re- 
search has begun to fill this void; since 1 980. over 
20 studies relevant to the issues of children's 
suggestibility have included a prescho-'l sample. 

A second and more important concern was 
that most of the previous studies involved children's 
recall of events that were forensically irrelevant. 
In most of this earlier literature, researchers exam- 
ined the influences of a single misleading sugges- 
tion or a leading question on children's reports of 
neutral, nonscript ed, and often uninteresting events 
that occurred in a laboratory setting. Although 
these results may be of importance for theoretical 
conceptualizations of the mechanisms that under- 
lie suggestibility effects and memory processes, 
they have limited practical and legal relevance to 
the reliability of the child witness. In many court 
cases, the allegations involve the child as partici- 
pant and not as bystander; they involve the child's 
recall of salient, rather than peripheral, events; 
they often involve repeated interviews which are 
highly suggestive; and they frequently involve 
emotionally charged and highly stressful events, 
such as sexual molestation. The earlier experi- 
ments of this century provide no clues as to the 
testimonial accuracy of children in such circum- 
stances. 

But how does a researcher conduct an ethi- 
cally acceptable experiment that mirrors the many 





conditions that are characteristic of the child vic- 
tim-witness? It would be unacceptable, for ex- 
ample. to determine whether an interviewer can 
successfully suggest to children with substanti- 
ated histories of abuse that the abuse had never 
taken place. Similarly, it would be unacceptable to 
determine if nonabused children will make allega- 
tions of sexual abuse after a highly suggestive 
interview. It is ethically impermissible to alter 
such fundamental aspects of young children's 
autobiography. 

Modern Studies of Children 's Suggestibility 

In the past several years, a number of re- 
searchers have attempted to deal with these issues 
by developing new paradigms which admittedly 
do not mirror all of the conditions that bring 
children to court, but which do contain some 
important elements of the child witness's experi- 
ences. This section describes three major lines of 
recent research, each of which illustrates a differ- 
ent paradigm: (I ) increasing the salience of the 
experienced events about which children will be 
interviewed. (2) increasing the dynamics of the 
interview situation, and (3) adding anatomically 
correct dolls to the interviewing context. 

Increasing the salience of events. s dis- 
cussed above, earlier studies were critici: J as not 
forensically relevant because they did not exam- 
ine how chi Idren respond to questions about events 
that involved their own body, or about other sa- 
lient events that occurred in personally experi- 
enced and stressful situations. In response, a num- 
ber of researchers have designed studies in w hich 
children are asked misleading questions about 
being touched. In some studies, children are ques- 
tioned about their previous interactions with an 
experimenter in a laboratory (e.g., Rudy & 
Goodman. 1991). In other studies, children arc 
questioned about an inoculation (Goodman. 
Ilirschman. Ilepps, & Rudy. 1991) or a genital 



examination (Saywitz. Goodman. Nicholas. & 
Moan. mi). 

For example. Saywitz and her colleagues 
(Saywitz etal.. 1991 ) examined 5- and 7-year-old 
girls' memories of a medical examination. Half of 
each age group had a scoliosis exam (for curvature 
of the spine), and the other children had a genital 
exam. When children were interviewed either I or 
4 weeks later, they were asked suggestive and 
nonsuggestive questions that w>ere abuse-related 
(e.g.. "How many times did the doctor kiss you?") 
or nonabuse-related (e.g.. “Didn't the doctor look 
at your feet first?"). Although the older children 
were initially more accurate than the younger 
children on most questions, some of these age 
differences disappeared after the 4-week delay. 
Most importantly, although there were age differ- 
ences in response to the suggestive abuse ques- 
tions, very few children of either age gave incor- 
rect responses: the 7-year-old children never made 
a false report of abuse, and the 5-year-olds did so 
only 4 times, although they were given 2 1 5 oppor- 
tunities. 

Saywitz and her col leagues point out specific 
patterns of results in this study. They conclude the 
children’s inaccurate reports involved mainly er- 
rors of omission rather than commission. The 
majority of children in the genital examination 
condition did not disclose genital contact unless 
specifically asked todoso. This latter opportunity 
was only provided with the direct (leading) ques- 
tion format (“Did the doctor touch you here?"). In 
the scoliosis condition, when children were asked 
these direct quest ions. 2.S(i9( of the chi Idren falsely 
affirmed vaginal touch and 5.56 f -f falsely affirmed 
anal touch? In reviewing this study. Goodman 
and C'larke-Stewart ( 1991 ) conclude that: 

. . . obtaining accurate testimony 
about sexual abuse from young 
children is a complex task. Part of 
the complexity rests in the fact that 



there are dangers as well as ben- 
efits in the use of leading questions 
with children . The benefits appear 
in the finding . . . that leading 
questions were often neeessary to 
elieit information from children 
about actual events they had expe- 
rienced (genital touching). . . . The 
children. . . . were generally accu- 
rate in reporting specific and per- 
sonal things that had happened to 
them. If these results can be gen- 
eralized to investigations of abuse, 
they suggest that normal children 
are unlikely to make up details of 
sexual acts when nothing abusive 
happened. They suggest that chil- 
dren will not easily yield to an 
interviewer's suggestion that some- 
thing sexual occurred w hen in tact 
it did not. especially if nonin- 
timidating interviewers ask ques- 
tionschildren can comprehend, (pp. 
102-103) 

Thus, according to this group of researchers, 
earlier studies of children's suggestibility may 
have overestimated the extent to which they are 
suggestible. For example: 

There is now no real question that 
the law and many developmental- 
ists were wrong in their assump- 
tion that children are highly vul- 
nerable to suggestion, at least in 
regard to salient details. Although 
some developmentalists may be 
challenged to find developmental 
differences in suggestibility in in- 
creasingly arcane circumstances, 
as a practical matter who really 
cares whether 3-year-old children 
are less suggestible about periph- 



eral details in events that they wit- 
nessed than are 4-year-old chil- 
dren' Perhaps the question has 
some significance for developmen- 
tal theory, but surely it has little or 
no meaning for policy and practice 
in child protection and law. 

(Melton. 1992. p. 15?'? 

It is important, however, to point out that not 
all data on children's reports of medical proce- 
dures are consistent with these conclusions. 
Ornstein and his colleagues (Baker-Ward, Gor- 
don. Ornstein, Larus. & Clubb, in press; Ornstein. 
Gordon. & Larus. 1 992) found that when children 
were later questioned about their memories of the 
visit to the pediatrician. 3-year-olds were more 
prone than 6-year-olds to make false claims in 
response to suggestive questions about silly events 
involving body contact (e.g.. "Did the nurse lick 
your knee?"). Oates and Shrimpton (1991) also 
found that preschoolers were more suggestible 
than older children about previously experienced 
events that involved body touching. In contrast to 
the Say wit/, et al. findings that false reports in 
response to suggestive questions are relatively 
infrequent, the younger children in these latter 
studies provided a substantial number of false 
reports in response to suggestive questions. Until 
recently, however, only a few studies have in- 
cluded explicit questions about sexual touching. 
Recent research by us and our colleagues has 
\ ielded different results, which will be reported in 
greater detail later when we describe our study of 
a pediatric examination. 

Increasing the dynamics of the interview. A 
second major innovative theme in the current 
research on children's suggestibility involves ex- 
amining the effects of various interviewing tech- 
niques on children's reports. This locus has arisen 
in response to the concern that the interviewing 
procedures of earlier studies were less intense than 




those that bring children to court — so much so as 
to result in a potential of children's 

suggestibility (Raskin & Esplin. 1991; Steller. 
1991 ). 

The interviewing procedures used in tradi- 
tional laboratory studies and those used in the 
forensic arena differ in several ways. First, it is 
frequently the case that children who come to 
court are questioned weeks, months, or even years 
after the occurrence of an event, as opposed to 
minutes or days later. Suggestibility effects may 
be more salient after long delays, because the 
original memory trace has faded sufficiently to 
allow the suggestion to intrude more readily than 
might occur after shorter delays. 

Second, child witnesses are rarely inter- 
viewed only one time, by one interviewer, or 
under nonstressful conditions. The modal child 
witness has been interviewed between 4 and I I 
times prior to the first courtroom appearance; 
sometimes children are interviewed weekly for 
years about the same event — in therapy sessions, 
for instance. Leichtman and Ceci (1993) have 
suggested that the incessant use of leading ques- 
tions and suggest ions in these repeated interviews 
may result in a qualitatively different type of 
report distortion than that which arises from a 
single misleading question in a single postevent 
interview. 

Third, an examination of the interviews of 
some child witnesses reveals that the label "sug- 
gestive interview" may describe more than the use 
of misleading questions. Rather, implicit and ex- 
plicit suggestions can be woven into the fabric of 
the interview through the use of bribes, threats, 
repetitions of certain questions, and the induction 
of stereotypes and expectancies (Ceci ct Brack. 
199.3). 

Finally, the questioning of child witnesses is 
typically conducted by parents, therapists, and 



legal officials, all of whom represent status and 
power in the eyes of the child; children may thus 
be more likely to comply with the suggestions of 
these interviewers than with those of the neutral 
interviewers employed in most research studies. 

Although it is very difficult to create experi- 
mental conditions that simulate the confluence of 
the conditions present in child witness interviews 
(stressful episodes, with repeated and suggestive 
questioning over prolonged periods of time), re- 
searchers are beginning to examine how children's 
reports are influenced by the repetition of sugges- 
tions in multiple interviews prior to and following 
the occurrence of an event. In addition, research- 
ers have focused on the interviewer and the poten- 
tial ef fects that a particular interviewer’s bias may 
have on the reports elicited from young children. 
We confine our discussion here to three studies 
recently carried out w-ith ourcolleagues at Cornell 
and McGill universities, as they were designed 
specifically to address these issues (see Ceci & 
Brack (I993|. for discussion of additional stud- 
ies). In focusing on our own studies, we necessar- 
ily present a particularized view, inspired by our 
own hypotheses, assumptions, and values, but the 
studies are designed to build on and challenge 
extant research. 

In these studies, we patterned ourexperimen- 
tal manipulations after materials collected over 
the past decade from court transcripts and from 
therapy sessions and law enforcement interview's 
involvingchildren incases similartothe McMartin 
case where there was a strong suspicion of abuse 
(see transcripts in Ceci. in press; Ceci & Brack. 
1993). These materials reveal that a child's first 
"disclosure" about abuse commonly occurs when 
an interviewer pursues a single hypothesis about 
the basis of the child’s difficulties, which entails 
leading and suggestive interviews, often with fan- 
tasy inductions and ‘’self-empowerment" tech- 
niques the techniques themselves being poten- 



tially suggestive and stereotype inducing. Such 
disclosures are then pursued in law enforcement, 
child protective service, or therapeutic interviews. 

Study 1 : The effect of interviewer bias 

on children's reports 

Ideally a forensic interview should be guided 
by a hypothesis-testing framework. Just as scien- 
tists try to arrive at the truth by ruling out rival 
hypotheses or by falsifying a favored hypothesis 
(Ceci & Bronfenbrenner. 1991; Dawes. 1992; 
Popper. 1962). interviewers should, in similar 
manner, attempt to rule out rival hypotheses, rather 
than exclusively attempting to confirm their fa- 
vored one. However, because of situational pres- 
sures (e.g.. case workers must sometimes make 
immediate determinations of potential danger to a 
child), it is not feasible that interviewers generate 
and test every conceivable hypothesis or. con- 
versely. that they be ‘blind" to obviously relevant 
information pertaining to a main hypothesis that 
abuse is indeed present, failure to recognize rel- 
evant information provided by the child could 
result in crucial missed opportunities." But. as the 
following study shows, failure to test a rival hy- 
pothesis can result in reporting errors. 

In this study (Ceci, l.eichtman. & White, in 
press), we examined how an interviewer's hy- 
pothesis can influence the accuracy of young 
children's reports, Preschoolers were exposed to a 
game-like event and then interviewed 1 month 
later. The interv iewer was given some informa- 
tion about events that mil’ll! have occurred; some 
of the information was accurate and some of it was 
inaccurate. The interv ievv er was told to interview 
each child and to use whatever strategies she felt 
necessarv to elicit the most factually accurate 
report from the child. The information we pro- 
vided influenced the interviewer's hypotheses 
about what had transpired in this game, which, in 
turn, appeared to exercise a povv erlul inf lucncc on 



the dynamics of the interview, with the inter- 
viewer eventually shaping some of the children's 
reports to be consistent with her hypothesis about 
what had happened. When the interviewer was 
accurately informed, she got children to recall 
correctly 93# of the events that had transpired. It 
is important to note that the children made no false 
accusations when the interviewer was correctly 
informed, that is. they only made "errors of omis- 
sion.'' However, when the interviewer was misin- 
formed. 34# of the 3- to 4-year-olds and 18# of 
the 5- to 6-year-olds corroborated one or more 
false events that the interview'er erroneously be- 
lieved had transpired. Thus, in the misinformed 
condition, the children made “errors of commis- 
sion." Finally, the children seemingly became 
more credible as the interview unfolded. Many 
children initially stated details inconsistently, or 
with reluctance or even denial, but as the inter- 
viewer persisted in asking about nonevents, some 
abandoned their hesitancy and denials. 

Because the interviewers were trained pro- 
fessionals (one was an experienced social worker, 
the other a nursery school teacher), we feel that the 
tvpes of interactions observed in this study may be 
similar to those that occur in interviews between 
voting children and parents, teachers, and profes- 
sionals who are not given explicit training in how 
to generate and test alternative hypotheses. Our 
review of the materials from some publicized 
cases, such as McMartin. reveals that professional 
interviewers often steadfastly stiek with one line 
of inquiry even when children continue to deny 
that the questioned events ever occurred (for ex- 
amples see Ceci, in press). 

Study 2; The effects of stereotype 
induction and repeated suggestions on 
voting children's reports 

A stranger named Sam Stone paid a 2-minute 
v isit to preschoolers (aged 3 to 6 ) in their day-care 



center (Lcichtman & Ccci. in press). Following 
Sam Stone's visit, the children were asked for 
details about the visit on four different occasions 
over a 10-week period. On each occasion, the 
interviewer refrained from using suggestive ques- 
tions; she simply encouraged children to describe 
Sam Stone's visit in as much detail as possible. 
One month later, the children were interviewed a 
fifth time by a new interviewer, who first elicited 
a free narrative about the visit. Then, using probes, 
she asked about two “nonevents" which involved 
Sam Stone doing something to a teddy bear and a 
book. In reality, he never touched either item. 

When asked in the fifth interview. “Did Sam 
Stone do anything to a book ora teddy bear?" most 
children accurately replied. “No." Only \Q c /< of 
the youngest (3- to 4-year-old) children's answers 
contained claims that Sam Stone did anything to a 
book or teddy bear. When asked if they ac. ally 
saw him do anything to the book or teddy bear, as 
opposed to “thinking they saw him do some- 
thing.'' or “hearing he did something." now only 
5 ( '< ol their answers contained claims that any- 
thing occurred. Finally, when these 5 r/ < were gen- 
tly challenged (“You didn't really see him do 
anything to the book/the teddy bear, did you?") 
only 2.5 r /( still insisted on the reality of the fic- 
tional event. None of the older (5- to 6-year-old ) 
chi klren reported that they had seen Sam Stone do 
either of the fictional actions. 

Another group of preschoolers was presented 
with a stereotype of Sam Stone before he ever 
visited their school. We did this to mimic the sort 
of stereotypes that some child witnesses have 
acquired about actual defendants. (In actual cases, 
for example, some children have been told repeat- 
edly that the defendant did “bad things.") Kach 
week, beginning a month prior to the v isit, the 
children in our study were told a new Sam Stone 
story in which he was depicted as very clumsy. For 
example: 



You'll neverguess who visited me 
last night. ( pause | That's right. Sam 
Stone! And guess what he did this 
time? He asked to borrow mv 
Barbie and when he was carrying 
her down the stairs, he tripped and 
fell and broke her arm. That Sam 
Stone is always getting into acci- 
dents and breaking things! 

Following Sam Stone's visit, these children 
were interviewed four times over a 10-week pe- 
riod. These four interviews contained erroneous 
suggestions (e.g.. “When Sam Stone ripped that 
book, was he being silly or was he angry?”). At the 
filth interview, these children were asked for a 
free narrative about Sam's visit and were then 
asked probing questions about the two nonevents. 

In this last interview. 72 f 7r of the youngest 
preschoolers claimed that Sam Stone did one or 
both misdeeds, a figure that dropped to 44 ( /< when 
they were asked if they actually saw him do these 
things. Importantly. 2\ r /< continued to insist that 
they saw him do these things, even when gentlv 
challenged. The older preschoolers, though more 
accurate, included I l c i of children who insisted 
they saw him do the misdeeds. 

Some researchers have opined that the pres- 
ence of perceptual details in reports is one of the 
indicators of an actual memory, as opposed to a 
confabulated one (Raskin & Yuille. 1989; 
Schooler. Gerhard. & Loftus. 1 986). In this study, 
how ever, the presence of perceptual details was no 
assurance that the report was accurate. In fact, 
children in the stereotype plus suggestion condi- 
tion produced a surprising number of fabricated 
perceptual details to embellish their false accounts 
of nonevents (e.g.. claiming that Sam Stone took 
the teddy bear into a bathroom and soaked it in hot 
water before smearing it with a crayon). The 
difference in the quality of reports obtained in this 
studx compared to others in the suggestihilitx 



literature may reflect the conditions under which 
the reports were obtained. As mentioned earlier, in 
most past studies, children's erroneous reports 
were in response to a single misleading question, 
posed after a brief delay following the event in 
question. In contrast, in the present study, children's 
false reports were a produet of repeated erroneous 
suggestions over a relatively long period of time, 
coupled with a stereotype that was consistent with 
these suggestions. 

it is one thing to demonstrate that children 
can be induced to make errors and include percep- 
tual details in their reports, but it is another matter 
to show that their faulty reports are convincing to 
others. To examine the belie vahi I ity of the 
children's reports, we showed videotapes of their 
final interview to approximately 1.000 research- 
ers and clinicians who work on children's testimo- 
nial issues. These researchers and clinicians were 
told that all the children observed Sam Stone's 
visit to their day-care centers. They were asked to 
decide which of the events reported by the chil- 
dren actually transpired and then to rate the overall 
credibility of each child. 

The majority of the professionals were inac- 
curate. Analyses indicated that these experts — 
who conduct research on the credibility ol 
children's reports, provide therapy to children 
suspected of having been abused, or carry out law 
enforcement interviews with children — generally 
failed to detect which of the children's claims 
were accurate, despite being confident in their 
judgments. Since so many ol the children claimed 
that Sam Stone ripped the book and/or soiled the 
bear, it is understandable that many of the experts 
reasoned that these events must have transpired. 
But their overall credibility ratings of individual 
children were also highly inaccurate, with the very 
children who were least accurate being rated as 
most accurate. We believe that the highly credible 
vet inaccurate reports obtained from the children 



resulted from a combination of repeated inter- 
views with persistent and intense suggestions that 
built on a set of prior expectations (i.e.. a stereo- 
type). In a similar way. it may become difficult to 
separate credibility from accuracy when children, 
after repeated interviews, give a formal video- 
taped interview or testify in court. 

Study 3: Influencing children's reports 

of a pediatric v isit 

It could be argued that the Sam Stone Study 
is not relevant to evaluating the reliability of a 
child witness who reports personally experienced 
events involving his or her own body, especially 
when the experience involves some degree of 
distress. Furthermore, some might argue that the 
Sam Stone data are not germane to testimony 
about highly predictable and scripted events. In 
cases where the event involves a child's own 
body, is somewhat stressful, and is predictable, it 
is often thought that children may be less prone to 
suggestion. 

To determine if children could be misled 
under such circumstances, we examined the influ- 
ence of postevent suggestions on children's re- 
ports about a pediatric visit where they were 
examined (C'eci. Leichtman. &. Bruck. in press). 
The study had two phases. In the first phase. 5- 
year-old children visited their pediatrician for an 
annual check-up. A male pediatrician examined 
the child. Then the child met a female research 
assistant who talked about a poster that was hang- 
ing on the wall in the examining room. Next, the 
pediatrician gave the child an oral polio vaccine 
and a DPT inoculation. Then the research assistant 
gave the child one of three types of feedback about 
how the child had acted when receiving the inocu- 
lation. One group was given pain-affirming feed- 
back; they were told that it seemed as though the 
shot really hurt them, hut shots hurt even big kids 



(hurt condition). A second group was given pain- 
denying information; these children were told that 
they acted like the shot did not hurt much, and that 
they were really brave (no-hurt condition). Fi- 
nally. a third group was merely told that the shot 
was over (neutral condition). After the feedback, 
the research assistant gave each child a treat and 
then read the child a story. One week later, a 
different assistant \ i sited the children and asked 
each one to indicate through the use of various 
rating scales how much he or she had cried during 
the shot and how much the shot hurt. 

The children's reports did not differ as a 
function of feedback condition. Thus, we found 
that children could not be influenced to make 
inaccurate reports concerning significant and 
stressful procedures involving their own bodies. 
These results are similar in spirit to those of 
Saywitz et al. ( 1991 ) who also provided children 
with suggestions about stressful, personally expe- 
rienced events in a single interview and discov- 
ered that children can be quite resistant to errone- 
ous suggestions. 

In the second phase of our study, we 
reinterviewed the children three more times, ap- 
proximately I year after the shot. During these 
interviews, children were provided with repeated 
suggestions about how they had acted when they 
received their inoculations. Thus, as in the fir t 
phase of the study, some children were told that 
they were brave when they got their shot, whereas 
other children were not given any feedback. (For 
ethical reasons, we provided only "no-hurt" and 
"neutral" feedback in this phase of the study. We 
felt that providing "hurt' feedback might induce 
false or unpleasant memories about visiting the 
doctor.) When the children were visited for a 
fourth time and asked to rate how much the shot 
had hurt and how much they had cried, there were 
large suggestibility e fleets. Those who had been 
repeatedly told that they had acted brave when 



they had received their inoculation a year earlier 
reported significantly less crying and less hurt 
than children who were given no feedback. Thus, 
these data indicate that children's reports of stress- 
ful events involving their own bodies can be 
distorted under certain circumstances. 

In the second phase of this study, we also 
tried to mislead children about the people who 
performed various actions during the original in- 
oculation visit. Some children were falsely re- 
minded on three occasions that the pediatrician 
gave them treats, showed them the poster, and read 
them a story. Some children were falsely re- 
minded on three occasions that the research assis- 
tant gave them the inoculation and the oral vac- 
cine. Control children were merely reminded that 
"someone" did these things. Based on the conclu- 
sions of other researchers (e.g.. Fivush. 1993; 
Melton. 1992). it was hypothesized that children 
should not be suggestible about such important 
events and that they should be particularly im- 
mune to suggestions that incorporate shifts of 
gender. The male pediatrician had never given 
them treats or read them a story, and the female 
research assistant had never performed any medi- 
cal procedures. 

Contrary to these predictions, the children 
were misled. In the fourth interview, when asked 
about theirdoctor's visit in the previous year. 67% 
of the children (versus 27% of the control chil- 
dren) who were given misleading information 
about the pediatrician reported that the pediatri- 
cian showed them the poster, gave them treat: . or 
read them a story. For children who were falsely 
told that the research assistant had given them the 
shot and the vaccine. 50% (versus 16% of the 
control children) fell sway to at least one of these 
two suggestions. Interestingly. 38% of the chil- 
dren who were given misleading information that 
the research assistant cave them the oral vaccine 
and the inoculation also said that the research 
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assistant had performed other scripted events that 
not only had never occurred blit also had never 
been suggested (e.g.. reporting that the research 
assistant checked their ears and nose). None of the 
control children made such inaccurate reports. 
Thus, our suggestions influenced not only 
children's reports of personally experienced, sa- 
lient events, but also their reports for nonsuggested 
scripted events that were related to the suggested 
events. 

These data indicate that under certain cir- 
cumstances children's reports concerning stress- 
ful events involving their own bodies can be 
influenced. The two factors that were most critical 
to this pattern of results were the intensity of the 
suggestions (i.e.. repeating the suggestions over 
multiple interviews) and their timing (i.e.. the long 
delay between the original event and interview 
about the event). These same two factors are 
characteristic of the conditions under which chil- 
dren made allegations of sexual abuse in many of 
the cases described at the beginning of this report. 

The results of this study are consistent with 
the Sam Stone study even though the nature of the 
e\etr about which children were misled were 
different. In the Sam Stone study, repeated sug- 
gestions and stereotypes led to convincing fabri- 
cations of nonoccurring events. In the pediatrician 
study, misleading information given in repeated 
interviews after a long delay following a target 
event influenced children's reports of personally- 
experienced. salient events. 

V7/o suiixcslihilitx of anatomically correct 
dolls. Anatomically correct dolls arc frequently 
used by professionals, including child therapists, 
police, child protection workers, and attorney s. in 
interviewing children about suspected sexual 
abuse. According to recent surveys. 90'* of field 
professionals use anatomical dolls at least occa- 
sionally in their investigative interviews vv ith chil- 
dren suspected of having been sexually abused 



(Boat & Everson, 1 988; Conte. Sorenson. Fogarty. 

& Rosa, 1991). Although no national figures are 
available, it appears that expert testimony is often 
based on observations of children's interactions 
with such dolls (Mason, 1991). We include a 
discussion here of anatomical dolls, because a 
number of commentators have raised questions 
about whether the dolls are suggestive (e.g.. 
McGough. in press; Moss. 1988; Raskin & Yuille. 
1989). 

One rationale for the use of anatomical dolls 
is that they allow children to manipulate objects 
reminiscent of asexual event, thereby cuing recall 
and overcoming language and memory problems. 
Another rationale is that their use is thought to 
overcome embarrassment and shyness. The dolls 
have also been used as projective test . Some 
claim that if a child actively avoids these dolls, 
shows distress if they are undressed, or show's 
unusual preoccupation with their genitalia, this is 
consistent with the hypothesis that the child has 
been abused (see Mason. 1991 ). 

The use of anatomically correct dolls has 
raised skepticism, however, among researchers 
and professionals alike. Two related arguments 
are frequently invoked against their use. The first 
is that the dolls are suggestive, that they encourage 
the child to engage in sexual play even if the child 
has not been sexually abused (e.g.. Gardner. 1989; 
Terr. 1988). A child may insert a finger into a 
doll's genitalia, for instance, simply because of its 
novelty or "affordance." much the way a child 
may insert a finger inti) the hole in a doughnut. 
Another criticism is that it is impossible to make 
firm judgments about children's abuse status on 
the basis of their doll play because there are no 
normative data on nonabused chi Idren’s doll play . 

In several studies, researchers have com- 
pared the doll play of sexually abused and 
nonabused children. In addition, there have been a 
score of studies examining the doll play of 



nonabused children. Reviews of this literature 
(Berry & Skinner. 1993; Ceci & Bruek. 1993; 
Wolfner. Faust. & Dawes, 1993) indicate that 
many of the studies are methodologically inad- 
equate and do not allow for firm interpretations 
about the potential usefulness or risks of using 
dolls. Furthermore, some data indicate that some 
of the play patterns thought to be characteristic of 
abused children, such as playing with the dolls in 
a suggestive orexplieit sexual manner, or showing 
reticence or avoidance when presented with the 
dolls, also occur in samples of nonabused children 
(see Bruek & Ceci 1 1 99.3 1 for a review). Finally, 
other data indicate that the dolls, though not sug- 
gestive. do not improve reporting — particularly 
amongyounger children (e.g.. Goodman & Aman. 
1990). 

We have recently completed a study of 3- 
v ear-old children's interactions with anatomically 
correct dolls that highlights each of these results 
( Bruek. Ceci. Franca'ur, & Renick, in press). The 
children in this study visited their pediatrician for 
theirannual check-up. The pediatrician conducted 
genital examinations with half the children; the 
remaining children did not receive genital exams. 
Immediately after the examination, the child was 
interviewed by a research assistant. Pointing to the 
buttocks and then to the genital areas of an ana- 
tomically correct doll, the assistant asked each 
child. "Did the doctor touch you here?” Later in 
the interview, the child was asked to use the doll to 
show how the doctor had touched his or her 
buttocks and genitals. 

Children were quite inaccurate across all 
conditions. Only 459{ of the children who re- 
ceived genital examinations correctly answered 
"Yes" to the questions “Did the doctor touch you 
here [on buttocks or genitals]?" Only 50' r of the 
children who did not receive genital exams cor- 
rectly replied "No" to these questions. Further, the 
children's accuracy did not improve when they 



were given the dolls and asked to show how the 
doctor had touched them. Only 2 5 ‘/< of the chil- 
dren who had received genital examinations cor- 
rectly showed how the pediatrician had touched 
their genitals and buttocks. (A significant number 
of female subjects in this condition were inaccu- 
rate. because they inserted their fingers into the 
anal or genital cavities of the dolls — which the 
pediatrician never did.) Only 45 9£ of the children 
who did not receive genital examinations were 
accurate in not showing any touching; that is. 55 f # 
of the children who did not receive genital exami- 
nations falsely showed either genital oranal touch- 
ing when given the dolls, a pattern most prevalent 
among the females in this group; 15 c /< of the 
females whodid not receive a genital examination 
falsely showed that the pediatrician touched their 
genitals or their buttocks. 

With the data on the potential usefulness of 
dolls equivocal at best, we feel that an important 
confound in the literature deserves mention: the 
context for the presentation of the dolls in these 
research settings is very different from that of 
actual forensic and clinical settings. Transcripts of 
therapy sessions with children suspected of hav- 
ing been sexually abused reveal interviewers em- 
ploying various practices: naming the dolls after 
defendants: berating the dolls for alleged abuses 
against the child (e.g.. shaking a finger at the male 
doll who has been named after the defendant and 
yelling. "You are naughty for hurting Jennifer!"): 
assuming the role of fantasy characters in doll 
play: and creating a persistent atmosphere of accu- 
sation. In the research settings in which the use of 
anatomical dolls has been studied, nonabused 
children were never subjected to such highly sug- 
gestive experiences prior to being interviewed 
with the dolls: they were not given prior motiva- 
tion to play with the dolls suggestively or aggres- 
sively. On the other hand, children who were 
alleged to have been abused were sometimes ex- 
posed to the dolls repeatedly prior to coming to the 
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research setting; perhaps these interviews had 
involved repeated suggestions from parents and 
interviewers about various sexual themes. That 
their play with the dolls differed from that of 
nonabused children w ho lacked this prior experi- 
ence could be attributed to the abused children's 
prior therapeutic or investigatory experiences, 
rather than to any inherent way in which they 
might be expected to play w ith the dolls. 

Unfortunately, no study has examined the 
suggestive attributes of anatomical dolls, control- 
ling for the preexperimental experience as a po- 
tentially serious confound. We simply do not 
know how nonabused children would behave 
with the dolls were they to have suggestive expe- 
riences prior to the experimental interview^ Con- 
versely. we also do not know how abused children 
play with the dolls in their first investigatory 
interview, since the children in these studies have 
often been interviewed more than once and some 
have been exposed to the dolls at least once, prior 
to the experimental interview. 

On the basis of our literature review (Ceci & 
Bruck. 1993). w e concluded that the inconsistent 
findings point to the need for additional research 
and to the need for the development of explicit 
procedures to govern the use of anatomically 
correct dolls by interviewers. Until such research 
is available, the dolls ought to be used with great 
caution. Recently. Berry. & Skinner ( 1993) and 
Wolfnerand his colleagues ( 1993) were even less 
supportive of doll use: 

. . . we are left w ith the conclusion 
that there is simply no scientific 
evidence available that would jus- 
tify clinical or forensic diagnosis 
of abuse on the basis of the dolls. 

The common counter is that such 
play is "just one component" in 
reaching such a diagnosis based on 
a "full clinical” picture. . . . |Ooll| 



play cannot be validly used as a 
component, however, unless it pro- 
vides incremental validity, and 
there is virtually no evidence that it 
does." (Wolfneret ah. p. 9) 

Summary of current literature. The studies 
reviewed here highlight the different paradigms 
that researchers are now employing to examine 
children's suggestibility. In our review of this 
literature (Ceci & Bruck. 1993). we found that 
results of the most recent studies, in contrast to 
older ones, are somewhat more contradictory about 
the reliability of children's reports. One can locate 
studies claiming that young children are as im- 
mune to suggestion as older children (e.g.. Marin. 
Holmes. Guth. & Kovac. 1979: Say wit/., et ah. 
199 1 ). and studies claiming that younger children 
are more suggestible (Ceci. Ross. & Toglia. 19X7: 
Cohen & Hamick. 1980; King & Yuille. 1987). 
Such mixed results have led to a confusing juxta- 
position of headlines: "Study shows children are 
credible as witnesses." Or. "Research shows child 
witnesses unable to distinguish reality from fan- 
tasy ." 

A careful reading of the literature suggests, 
howev er, that there are reliable age differences in 
suggestibility, with preschoolers' reports more 
influenced by erroneous suggestions than older 
children's. In our review of the suggestibility 
literature, we found 18 studies that compared 
preschoolers to older children or to adults; in 13 
of 18 of these studies, suggestibility was greater 
among preschoolers than older children or adults 
(see Table 2 in Ceci it Bruck. 1993). To be sure, 
some researchers attach various caveats to this 
conclusion. For example, some have claimed that 
age differences in suggestibility are evident mainly 
for nonparticipant children, i.e.. bystanders ( Rudy 
& Goodman. 1991 );and for peripheral, nonsalient 
events ( F'ivush. 1 993 ). And some researchers find 
that although young children may make some 
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errors in response to suggestive questions with a 
sexual theme, on the whole they are highly resis- 
tant to such questions (e.g.. Say wit/ et al.. 1991: 
Goodman et al., 1991). Still others have found 
larger age differences in suggestibility for ques- 
tions with sexual themes (e.g., Baker-Ward et al., 
in press: Goodman, Rudy, Bottoms. & Aman. 
1990) and for questions about salient events (e.g., 
Cassel & Bjorklund. 1993). 

Although preschoolers are usually depicted 
as being the most suggestible, it is important to 
point out that older children and adults are also 
suggestible. For example, as described above. 7- 
year-olds* reports, after I year, of their visits to the 
pediatrician could be quite easily altered through 
suggestion. Clarke-Stewart. Thompson. & Lepore 
(reported in Goodman & Clarke-Stewart. 1991 ) 
also found that 7-year-old children's reports and 
interpretations of a recently experienced event 
could be easily manipulated through suggestion. 
Also, Goodman. Wilson. Ha/.an. & Reed (1989) 
found that a substantial number of 7- to l()-year- 
old children incorrectly agreed with interviewers' 
suggestions about details of an event that occurred 
4 years earlier. Many of these misleading sugges- 
tions had sexual themes. Finally, suggestions can 
alter some fundamental aspects of adults' autobio- 
graphical memories ( Loft us 1 993 ). Thus, we can- 
not conclude that older children and adults are not 
suggestible, only that their level of suggestibility 
is less than that of preschoolers. 

We reiterate, however, that the conditions 
created in these studies differ markedly from those 
that occur in actual therapy or in law enforcement 
investigations: these latter two contexts are sel- 
dom as sanitized of affect and free of motives as 
those in the research setting. The real life situation 
may entail high levels of stress, assaults to the 
child's body, and loss of control. In some cases, 
children are interviewed and reinterviewed under 
emotionally charged circumstances, entailing the 



use of bribes and threats, and often in the presence 
of highly distressed parents; under such condi- 
tions some children may finally utter reports that 
are simply consistent with the interviewer's ex- 
pectations. In the McMartin case, interviewers 
were alleged to have coerced children's state- 
ments by praising them when they reported events 
that were consistent with the interviewer's beliefs 
and criticizing them for failing to do so (e.g.. 
calling them "dumb"). Interviewers in both this 
case and other day-care cases also told children 
that other children had already disclosed the de- 
tails of the abuse, thus creating added pressure to 
assent to suggestions of abuse. Not surprisingly, 
interviewers in the McMartin case managed to 
elicit statements of abuse from 369 of nearly 400 
children they interviewed (Sauer, 1993). although 
only one child had made claims of abuse prior to 
the interviews. (This girl's accusations were so 
bizarre that the prosecution dropped them from 
the case | Sauer. 1 993 1 ). 

Hlsewhere we and others have used more 
emotionally laden events to examine issues re- 
lated to the role of affect and bodily touching in 
producing misinformation effects, including sug- 
gestions about being kissed while naked, witness- 
ing parents violate norms, or hurting others to 
protect loved ones (see Ceci. Leichtman. Putnick. 
& Nightingale. 1993). and experiencing painful 
and/or embarrassing medical procedures (e.g.. 
Goodman. 1993: Ornstein, Baker-Ward. Gordon. 
& Merritt, 1993). Although children's resistance 
to suggestions are sensitive to all of these factors 
( and others ), no study has attempted to incorporate 
all of them into a single experiment. 

It is highly unlikely, however, that we will 
ever mimic the assaultive nature of some acts or 
interviews perpetrated on child victims and wit- 
nesses. Thus we are far from being able to provide 
a definitive conclusion about the reliability of all 
child witnesses' reports. It is safe to conclude. 
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though, that past pronouncements hy some rather 
extreme advocates on both sides of the bench are 
simply unfounded. Children are neither as hyper- 
suggestible and eoachable as some prodefense 
advocates have alleged, nor as resistant to sugges- 
tions about their own bodies as some proprosecu- 
tion advocates have claimed. They can be led. 
under certain conditions, to incorporate false sug- 
gestions into their accounts of even intimate bodily 
touching, but they can also be amazingly resistant 
to false suggestions and able to provide highly 
detailed and accurate reports of events that tran- 
spired weeks or months ago (e.g.. Baker-Ward et 
al.. in press). This mix of suggestibility and resis- 
tance to suggestion underscores the need tor great 
caution in accepting the claims of those who 
would put either a prodefense or proprosecution 
''spin'" on the data. 

Policy Implications 

Expert Witnesses 

When a child comes to court to testify, this is 
often because he or she is the sole witness to a 
crime: this is particularly likely to be the situation 
in sexual abuse cases where the child is not only 
the sole witness, hut there may be no physical 
evidence of abuse. The problem of uncorrobo- 
rated testimony is compounded by the fact that the 
testimony of children may at times seem to lack 
credibility. As a result, both the prosecution and 
defense may call physicians, mental health profes- 
sionals, and social scientists to servo as expert 
witnesses. In this section, we discuss the t|uali I ica- 
tions and roles of mental health professionals and 
social scientists who serve as expert witnesses in 
cases in\ olving child witnesses, particularly in 
cases of alleged sexual abuse. 

According to legal views (see Mason. 1W1 : 
Myers, l ( W). these expert witnesses can be clas- 
sified into two categories. ( I ) The first type, usu- 



ally a mental health professional, is asked either to 
provide a generic description of the behavioral 
symptoms associated with sexual abuse, or to 
provide an opinion as to whether or not a particular 
child was abused. In the latter instance, the expert 
may have formulated his or her opinion based on 
therapy with the chi Id or an assessment ot a child' s 
behavior; in some cases, the expert witness may 
have had no contact with the child in question. 
This type of expert is also frequently called upon 
to rehabilitate the credibility of a child witness 
who has been attacked by the defense, e.g.. for 
delayed reporting. In this situation, the expert 
witness explains that, though such behaviors are 
not themselves diagnostic of abuse, it is not un- 
usual for abused children to display a range of 
behaviors, such as recantation, delay of reporting, 
and inconsistent reporting. (2) The second type of 
expert witness is called to review the scientific 
literature on issues relevant to the credibility of 
child witnesses. This expert may cover various 
topics, including the literature on suggestibility as 
well as that on cognitive, emotional, and social 
development. 

Although one might conclude that the re- 
search oil children's suggestibility, discussed in 
this report, has policy implications for only the 
second type of witness, we argue that knowledge 
of this research is relevant to the professional 
qualifications and testimony of the first type of 
expert witness as well. The mental health profes- 
sional who testifies on the diagnosis of sexual 
abuse or who describes to a court the symptoms 
associated with sexual abuse must also take into 
consideration competing hypotheses that might 
explain why the child in question, or children in 
general, demonstrate particular symptoms or make 
allegations of sexual abuse. One of the alternative 
hypotheses to be considered is that the particular 
child's allegations or symptoms have resulted 
from suggestive influences of the sort described 
above. It is important for the expert to consider 
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such an alternative, because those same symptoms 
associated with sexual abuse (delayed reporting, 
retraction of the allegation, inconsistent accounts, 
inappropriate knowledge of sexual behavior, or 
unusual play with anatomically correct dolls) have 
been observed in nonabused children who have 
been exposed to suggestive influences (see Ber- 
liner & Conte. 1993; Kendall-Tackett. Williams. 
& Finkelhor. 1993, for recent reviews of the litera- 
ture). 

Diagnosing child sexual abuse is thus a com- 
plex task requiring experience with sexually abused 
children and knowledge of both the clinical and 
the suggestibility and developmental literature. 
Kxperts who testify on such matters should be 
well-versed in these domains (Myers. 1993). 

Some legal scholars have pointed out. how - 
ever. that little experience and training is required 
of expert witnesses who provide testimony to 
rehabilitate the child witness. Myers ( 1 993 ) writes 
that because this type of testimony is thought 
straightforward and simple, "a child protection 
services worker with six months on the job and 
knowledge of three or four pertinent articles is 
qualified to provide rehabilitative testimony on 
recantation and delayed reporting" (p. 177). We 
argue that this witness should have a more thor- 
ough knowledge of the scientific literature on both 
the indicators of child sexual abuse and the litera- 
ture on suggestibility. 

The second type of expert witness, those who 
testify about the scientific literature on suggest- 
ibility and child development, does not require 
clinical experience. However, this expert must 
have a thorough knowledge of the research litera- 
ture germane to his or her testimony. 

What the e xpert witness on children '.v sup- 
xestihilitx should tell the eourt. We come now to 
the question that has vexed any social scientist 
who e\er dreamed (or had nightmares) of being 



called upon to serve as an expert witness or to 
prepare an amicus brief for an appellate court on 
children's testimony, namely, what does our 
present state of scientific knowledge permit us to 
say about the reliability of the testimony of the 
child witness'? Having acknowledged the com- 
plexities of the research, we hold that expert 
witnesses, regardless of whether they are testify- 
ing for the prosecution or for the defense, should 
cover several points based on the literature; 

1 . There are reliable age effects in children's 
suggestibility, with nreschoolers being more vul- 
nerable than older cnildren to a variety of factors 
that contribute to unreliable reports. 

2. Although young children are often accu- 
rate reporters, some do make mistakes — particu- 
larly when they undergo suggestive interviews; 
and these errors can relate not only to peripheral 
details, but also to salient, predictable events that 
involve their own bodies. 

3. Measures can be taken to lessen the risk of 
suggestibility effects. To date, the factors that w e 
know most about concern the nature of the inter- 
view itself — its frequency, degree of suggestive- 
ness. and demand characteristics. 

• A child's report is less likely to be 
distorted, for example, after one in- 
terview than after several inter\ iews 
(the term "interviews" here includes 
informal conversations between par- 
ents and child about the target events ). 

• Interviewers who ask nonleading 
questions, who do not have a confir- 
matory bias (i.e.. an attachment to a 
single hypothesis), and who do not 
repeat close-ended, yes/no questions 
w ithin or across interviews, are more 
likely to obtain accurate reports from 
children. 
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• Interviewers who are patient, 
nonjudgmental. and who do not at- 
tempt to create demand characteris- 
tics (e.g.. by providing subtle rewards 
for certain responses) are likely to 
elicit the best quality reports from 
young children. 

Thus, at one extreme we can have more 
confidence in a child's spontaneous statements 
made prior to any attempt by an adult to elicit w hat 
they suspect may be the truth. At the other ex- 
treme. w e are more likely to be concerned when a 
child has made a statement after prolonged, re- 
peated, suggestive interviews. Unfortunately, most 
eases lie between these extremes and require a 
case -by-case analysis. 

4. Finally, it is also important that the court 
appreciate the complexity of the interrelationships 
of the factors affecting children's suggestibility. 
As in most areas of social science, eliects are 
rarely as straightforward as one might wish. I wen 
though suggestibility efleets may be robust, the 
effects are not universal. Results vary between 
studies, and children's behavior varies within stud- 
ies. Thus, even in studies with pronounced sug- 
gestibility effects, there are always some children 
who are highly resistant to suggestion. Some stud- 
ies may show reliable age differences in suggest- 
ibility even though the majority of both younger 
and older children did not succumb to suggestion. 
We have seen this in our own studies as well as in 
transcripts of forensic and therapeutic interview s: 
in some cases, no matter how much an interviewer 
may try to suggest that an event occurred, some 
children will consistently resist and not incorpo- 
rate the interviewer's suggestion or point of view. 
On the other side, although suggestibility effects 
tend to be most dramatic after prolonged and 
repealed interviewing, some children incorporate 
suggestions quickly, even after one short inter- 
view (e.g.. Clarke-Stcw art. et al.. 19X9, as re 



ported in Goodman & Clarke-Stewart. 1991 ). No 
facile conclusion can be presented to courts on this 
matter. 

Ideal vs. Actual Expert Witnesses. The 
"model" expert witness who comes forward to 
testify on issues related to children's suggestibil- 
ity should be someone who has thoroughly re- 
viewed the pertinent literature and who can present 
the relevant facts in a balanced manner to the triers 
of fact. This requirement is not an easy one to 
meet: this research area is developing rapidly and 
is riddled with a host of complex issues that 
necessitate a broad understanding of design, sta- 
tistics. and theory not likely possessed by some- 
one outside the research community. 

Unfortunately, many who serve as expert 
witnesses do not have this breadth ol knowledge. 
We have reviewed many examples of testimony 
by so-called experts that appeared to have been 
based on incomplete and at times dubious knowl- 
edge. Now here in their testimony is there any hint 
of the complexities that ought to have tamed the 
witness's statements to the jury. In the worst cases, 
the testimony was actually opposite to what we 
know to be the best evidence Irom systematic 
research. All too often such an expert appears in 
court strictly because his or her opinion is consis- 
tent with that of the defense or the prosecution, 
rather than because the witness is truly knowl- 
edgeable about the field. Such testimony can be a 
disservice to the aims of justice, not to mention to 
the professions these expert witnesses represent. 

Although the above discussion pertains 
mainly to the social scientist who testifies about 
children's suggestibility, our review of case mate- 
rial and the literature suggests that these same 
criticisms can be made of some experts who testify 
about the behavioral symptoms associated with 
sexual abuse. Mason (1991) analyzed 122 civil 
and criminal appellate court cases in which expert 
witnesses testified about child sexual abuse. She 
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found that experts frequently presented testimony 
that was either internally inconsistent or was con- 
tradicted by other experts. For example. 14 ex- 
perts cited age-inappropriate knowledge of sex 
and sexual preoccupations as characteristics of an 
abused child, whereas b experts asserted that na- 
ivete and aversion to sexual matters characterized 
the sexually abused child. Some experts main- 
tained that consistent accounts of events were 
important indicators of sexual abuse, whereas 
others maintained the opposite, that sexually 
abused children are characterized by their incon- 
sistent accounts. Mason also reported that appel- 
late courts tend to take expert testimony at face 
value: that they rarely raise questions about the 
testimony's acceptance by the scientific commu- 
nity. or about the credentials of the mental health 
professional presenting expert testimony. 

The response of the courts in this study rc- 
\eals that there is a critical gulf between the 
scientific community and the judiciary. J udges are 
not willing and probably not able to critically 
evaluate the reliability of the testimony offered 
( Mason. I 99 1 . p. 205 ). 

An example of the unscientific nature of 
some experts' testimony is illustrated by Kelly 
Michaels v. State of New Jersey. Michaels was a 
preschool teacher convicted on I 15 counts of 
sexual offenses involving 20 children, and sen- 
# tcnced to 47 years in prison. The expert witness for 
the prosecution testified that conduct of all but one 
of the child w itnesses was consistent with having 
been sexually abused. She did not seriously con- 
sider the possible effects of numerous suggestive 
influences on the children's testimony . which had 
included persistent, aggressive, and suggestive 
iutcrv iew s w ith children who initially denied that 
anything had happened. In this case, the appellate 
court did challenge the testimony of the expert 
witness, reversing Michael's com ict ion (after she 
had spent 5 years in prison), in part because the 
expert's testimony concerning the child bcha\ 



ioral indicators of abuse did not have acceptance 
w ithin the relevant scientific community. 

Problems with expert testimony are endemic 
to our legal system and to those of other countries 
whose codes have been derived from common 
law. As far back as one can check, jurists and 
laypersons alike have viewed expert witnesses as 
untrustworthy, as inclined to put a “spin" on 
interpretations of the data toward the side that 
hired them: 

Perhaps the testimony which least 
deserves credit with a jury is that of 
skilled witnesses. ... It is often 
quite surprising to see with what 
facility, and to what extent, their 
views can be made to correspond 
with the wishes and interests of the 
parties who call them. (Judge John 
Pitt Taylor. 1858. p. 65-69. as 
quoted in Gross. 1991 ). 

These views continue to be expressed b\ Ameri- 
can jurists: 

To put it bluntly, in many profes- 
sions serv ice as an expert witness 
is not generally considered honest 
work. . . . Experts in fields see 
lawyers as unprincipled manipula- 
tors of their disciplines, and law- 
yers and experts alike see expert 
witnesses — those members of the 
learned professions w ho will con- 
sort with lawyers — as whores. 

(Gross. 1991. p. 1115) 

And by British jurists: 

Expert evidence is sometimes 
given by people whose level of 
know ledge seems lamentably low . 

A number of the recent, and best- 
known scandals show this. . . . 

I low does this come about'.’ In the 



first place. 1 think it is because our 
present system provides no sys- 
tematic quality control. Broadly 
speaking, anyone can be an expert 
witness, provided they have some 
relevant knowledge, and nothing 
whatever is done to see that only 
the best people are used. To be 
allowed to give expert evidence, 
witnesses must satisty the judge 
that they have some practical ex- 
perience. or some professional 
qualifications: but that is all. No 
minimum standards are laid dow n. 

The only test is opposing counsel's 
cross-examination: and. in a jury 
trial, this may be designed to score 
clever points, rather than to test 
whether they (i.e.. the experts) are 
really good at their job. (Spencer. 

1 qi)2. pp. - lb-2 1 7 ) 

The Relationship of Research to ( littieal 
Practice 

That the judicial community is unwilling or 
unable to evaluate critically the testimony of so- 
cial science experts and mental health profession- 
als reflects to some degree an incomplete or inac- 
curate understanding ol the relevant knowledge 
base, but it also reflects, in the case of the reliabil- 
ity of children's reports, a gulf between clinical 
practice and social science research. As a result, in 
those cases where clinical practice is not into! med 
by research findings, clinicians and social scien- 
tists may present diametrically opposite expeit 
testimonv on the very same topic. Two examples 
illustrate this breach between the two peispec- 
tives. 

The first involves a survey ot 212 mental 
health professionals about their assessment and 
\ alidation proccduresin sexual abusecascs(( ontc 



et al.. 1 99 1 ). Of relevance to the present report, it 
was found that children had already been asked to 
tell their story an average of 2.3 times before 
talking to the professional respondent; only 21 c /c 
of respondents indicated that they were the first 
person to talk with the child about the abuse. In 
discussing these findings, however, the authors do 
not seriously consider the impact ot such inter- 
viewing practices: 

Little is currently known about the 
effects of such prior interviewing 
on the child's willingness to en- 
gage with yet another adult or on 
the quality of information obtained 
from the child. Whi le some profes- 
sionals are likely to make much ot 
the possible "contamination" that 
these prior interviewers have on 
the child's reports, there are virtu- 
ally no data currently availablesug- 
gesting that adults have the power 

through interviewing techniques to 

alter fundamentally a child s un- 
derstanding ot and ability to de- 
scribe what events did or did not 
take place, (p. 433). 

We hope that this report will begin to intorm 
professionals that such data ate available. 

A second example that illustrates the gull 
between practice and research concerns the use ol 
anatomically correct dolls. Many professionals 
have no formal training or experience in the use ot 
the dolls (Boat & Everson. 1988) and may view 
some interactions of children with the dolls (e.g.. 
placing a finger in the doll s anal cavity, tugging 
on its penis, or avoiding the dolls altogether) as 
indicative of sexual abuse, e\ en though there is no 
scientific support that such interactions are diag- 
nostic of abuse. In a recent survey, for example, 
onlv lb</, of mental health and law professionals 
stated that avoidance of the dolls was normal. 



while 807 rated digital penetration as abnormal 
(Kendall-Taekett. 1991 ). Yet. as reviewed above, 
such behaviors are commonly observed in 
nonahused children. 

01 more concern, perhaps, is the American 
Psychological Association's ( 1991 ) current posi- 
tion on the use of the dolls. The following state- 
ment was issued by APA's Council of Represen- 
tatives: 

Neither the dolls nor their use are 
standardized or accompanied by 
normative data. . . . We urge con- 
tinued research in quest of more 
and betterdata regarding the stimu- 
lus properties of such dolls and 
normative behavior of abused and 
nonahused children. . . . Neverthe- 
less. doll-centered assessment of 
children, when used as part of a 
psychological evaluation and in- 
terpreted by experienced and com- 
petent examiners, may be the best 
available practical solution for a 
pressingand frequent clinical prob- 
lem. (APA. 1991. p. I ). 

1 he APA s policy position seems contradic- 
tory in its noting first that there are no standardized 
methods for doll interviews or normative data on 
nonahused or abused children's dol I play, but then 
asserting that experienced interviewers mav nev- 
ertheless find doll-centered assessment the best 
available method forevaluating children suspected 
of having been sexually abuse. Hven if one as- 
sumes that experienced examiners can avoid mak- 
ing false inferences from children's doll play, and 
that such doll play can provide important clinical 
insights not obtainable from other sources, the 
APA should nevertheless codify this expert knowl- 
edge in such a way that researchers can accurately 
assess the incremental \ alidity of doll-based as- 
sessments. Our reading of the literature is that at 



present such knowledge is more illusory than real 
(see Wo I fner et al.'s criticism [ 1 993 J of the lack 
of i nc t e mental validity of dol I -based assessments ). 
I:\en it anatomical dolls are used as just one part 
of an assessment, other aspects of so-called “de- 
velopmentallv sensitive assessments'' (e.e.. play 
therapy, role playing, techniques that induce visu- 
ally-guided imagery, self-empowerment training) 
may interact with the doll use to produce false 
positive assertions of abuse. Because the appro- 
pi iate research has yet to be done, it is short- 
sighted to assume (as some experts have testified 
in court) that the dolls do not present reliability 
risks. Although it could be the case that the use of 
dolls does provide important information, it could 
also be the case that this method leads to unaccept- 
able levels of false positive reports. Only research 
will tell. 

The fact remains that clinicians and mental 
health ptofessionals face many dilemmas and 
choices in providing for children who may have 
been sexually abused. Often the favored choice 
may conflict with forensic procedures. Let us 
consider one scenario: a child has been removed 
from her home as a result of a report of sexual 
abuse and has been placed in emergency foster 
care, separated from her family, friends, and school. 
The child is greatly distressed and in need of 
immediate counseling. The forensic interviews 
will not be completed for several months. In light 
of some research findings that children's reports 
are likely to be more accurate if interviews ( which 
include therapy sessions) concerning the aliened 
abuse are held to a minimum until after the foren- 
sic inlet view takes place, when should the mental 
health professional begin therapy with the child? 
How can we avoid the twin dangers of. on the one 
hand, putting the child’s emotional needs on hold 
until afterthe forensic interview sand, on the other 
hand, providing counseling that can he potentially 
damaging to the veracity of the child's report ? We 
know of no easv answers. 



Given the pressing needs of both sides in a 
criminal dispute to prepare, investigate, and often 
reinterview, no amount of child-friendly court 
procedures can totally alleviate some of the prob- 
lems associated with children's testimony. Yet. 
perhaps there are ways of providing therapeutic 
support that lessen the likelihood of tainting the 
child's report. Therapeutic procedures that in- 
volve visually-guided imagery in the context of 
the abuse-related allegations might be avoided, as 
might forms of therapy that make contact with the 
abusive scenario (e.g.« self-empowerment train- 
ing. role playing, doll use. hypnosis). 

Although some might argue that it would be 
too restrictive and ultimately damaging to a child's 
development were therapists to avoid potentially 
suggestive techniques, it could also be argued that 
employing such interventions simply constitutes 
too great a risk. On the one hand, if the defendant 
is innocent, such techniques could promote and 
reinforce false allegations. On the other hand, if 
the defendant is guilty, these interventions may 
end up discrediting the child's testimony, with 
defense attorneys arguing that the child's reports 
are the product of highly suggestive therapeutic 
techniques. Finally, on the empirical side, we are 
unaware of any persuasive treatment— outcome 
validity research indicating that suggestive tech- 
niques are necessary in therapy to achieve a posi- 
tive mental health outcome for children suspected 
of being abused. Given this state of knowledge, 
clinicians might consider limiting interventions to 
//////suggestive techniques in therapy until young 
clients have given sworn statements: such an ap- 
proach may afford minimal danger to the child. 

Professional Organizations as Ethical 
(iatekeepers 

Professional organizations could help resol ve 
some of the problems we have been discussing by 
making ethics codes for expert witnesses more 



explicit. Existing codes forexpert witnesses of the 
organizations that represent various constituen- 
cies (psychology, social work, pediatries) tend to 
be weak and ill-defined — in part because "expert 
witness" is an ill-defined legal concept.*’ The Fed- 
eral Rules of Evidence 702 states that if scientific, 
technical, or other specialized knowledge will 
assist a fact finder in understanding evidence, then 
a witness may be regarded as an expert by virtue 
of his or her knowledge, skill, experience, train- 
ing. or education. This rule construes expertise 
broadly enough to coverall fields, includingemerg- 
ing areas within fields, and is constrained by two 
other Federal Rules of Evidence (401 and 403). 
which specify that the expert testimony must be 
relevant. Together, these Federal Rules allow v ir- 
tually anyone who possesses an advanced degree, 
or who has some clinical experience, to offer 
expert testimony on children's credibility, even 
though the expert may have scant knowledge of 
the current scientific findings. As a result, experts 
testifying in child sexual abuse eases have offered 
totally opposite interpretations of children's be- 
havior and testimony (see Mason 1 1091 | above). 

To some extent, weak ethics codes also re- 
flect the ascendancy of guild interests. Because no 
constituency wants to be excluded from activities 
that involve service to others (at times for financial 
gain), its representatives ensure that its members' 
role is not diminished by ethics code language. 
Ethics codes tend to be explicit about matters that 
are relatively benign to the group as a whole (e.g„ 
rules for preparing reports, or statements regard- 
ing generic conflicts of interest), but vague about 
matters that could adversely affect the entire mem- 
bership (c.g.. defining precisely what an expert 
should know in order to testify about children's 
suggestibility, or what it means to conduct a good 
interview). Thus, for example, when psycholo- 
gists look to their own specialty guidelines and 
general ethics codes for guidance about the cre- 
dentials or conduct of an expert witness, they find 



little help other than enjoinders to act responsibly, 
to be informed, and to aspire to the norms that 
guide a professional toward the highest ideals. 
Consider some of the sections of the most recent 
APA code of ethics revision (Ethical Principles. 
092) relevant to forensic services: 

• Psychologists appropriately take into 
account the ways in which a prior 
relationship might affect their pro- 
fessional objectivity or opinions and 
disclose potential conflict to the par- 
ties. (Section 705) 

• Psychologists who engage in. . .pro- 
fessional activities maintain a rea- 
sonable level of awareness of scien- 
tific and professional information in 
their fields. (General Standards 1.5) 

• Psychologists rely on scientifically 
and professionally derived knowledge 
when making scientific or profes- 
sional judgments. (General Standards 
I .b) 

• In addition, psychologists base their 
forensic work on appropriate know l- 
edge of a competence in the areas 
underlying such work. (General Stan- 
dards 7 . i r 

• In forensic testimony and reports, 
psychologists . . . describe fairly the 
bases for their testimony and conclu- 
sions | and | whenever necessary to 
a\oid misleading, acknowledge the 
limits of their data or conclusions. 
(General Standards 7.4). 

However well intended, these statements 
taken together lend themselves to ambiguous in- 
terpretation. For example, can a psychologist, in 
testifying about children's suggestibility, rely on 
research knowledge wclinical experience? 
( 'an a therapist be expected to a\ oul a conflict of 



interest and maintain sufficient objectivity to serve 
as an expert witness when he or she has had 
extended contact with the child? (Apparently so. 
Mason's analysis [ 1991 1 showed that many ex- 
pert witnesses who testified in abuse cases were 
often the child's therapist, and only 1 5 ( /c of all 
experts had no prior relationship with the child.) 

Missing from ethics codes and specialty 
guidelines for expert witnesses (e.g.. Committee 
on Ethical Guidelines, 1991) is language that 
would specify that they bring to court more than an 
advanced degree, a supervised internship which 
had brought them into contact with sexually abused 
children, or other clinical experience whereby 
they had occasionally seen sexually abused clients 
in their practice. An expert testifying on children's 
suggestibility, and more generally on the credibil- 
ity of child witnesses, should be intimately famil- 
iar with the systematic scholarship on the topic. 
Although it is not necessary for this expert to be a 
researcher, he or she needs to be at least a critical 
consumer of the research literature. 

The failure of professional organizations 
to constitute and then to enforce principled 
guidelines has serious consequences. First, it 
can undermine the judicial system's confi- 
dence in the capacity of professionals to 
offer reliable testimony. More important, in 
criminal proceedings where the defendant 
faces incarceration, or in civil proceedings 
w here the future placement of the child is at 
stake, the legitimacy of the expert's testi- 
mony can be critical to preserving the rights 
of both the child and the defendant. 

To conclude, enforcement mechanisms 
arc needed to ensure that expert testimony 
can be evaluated for its scientific merit. 

I Util such mechanisms arc openly adver- 
tised to all consumers of legal services, 
enjoinders to 'May informed" will probably 
do little to ebb the sorts of abuses reported 




by Mason ( 1991 ) and Spencer ( 1992). 

Because of their vagueness, professional 
ethics codes will be implemented more often 
in the breach than in the letter. 

Conclusion 

We have argued that the investigation of 
child sexual abuse allegations and expert testi- 
mony addressing such investigations are fraught 
with problems. Scientists have begun to contrib- 
ute important insights to these problems, though 
clearly more research is needed. We have pro- 
vided some troubling examples of how research 
has failed to inform practice, and how experts 
often go beyond what current scientific findings 
seem to warrant. 

To be sure, those charged with investigating, 
reporting, and treating suspected child maltreat- 
ment face immense obstacles. These profession- 
als are deeply aware of the pervasiveness of child 
sexual abuse, and the all too frequent ineffective- 
ness of prosecution. They know better than most 
the emotionally wrenching sequelae of abuse, 
especially intrafamiiial abuse. And they are keenly 
aware that the evidence from research must al- 
ways be tempered by real-world considerations, 
no matter what the “significance level" or “effect 
si/e" of a finding. Thus, many pressing, unre- 
solved issues concerning the interviewing and 
treatment of indiv idual children remain. 

We presented a scenario highlighting the 
difficult task faced by many professionals on a 
daily basis — how to promote two goals that often 
conflict: how to provide for the child's mental 
health needs while simultaneously protecting the 
legal rights of the accused. While we must strive 
to uncover abuse, we must eschew interview pro- 
cesses that may promote f alse beliefs, fantasies, or 
f abrications -regardless of the nature of the initi- 
ating event. Just as we haveariuied above that it is 



unethical for social scientists to institute experi- 
mental manipulations that might change the fun- 
damental nature of children's emotionally salient 
autobiographical memories, it is equally indefen- 
sible for therapists or forensic interviewers to 
cause such changes. The results of persistent erro- 
neous suggestions and of failures to test alterna- 
tive hypotheses can be lasting, as evidenced by the 
experiences and reactions of the child witnesses in 
the McMartin trials, described at the beginning of 
this report: 

No one who saw them will soon 
forget the frenzied faces of . . . 
former McMartin pupils (who) had 
spent their last six years — fully 
half their liv es — instructed in the 
faith that they had been subjected, 
at ages 4 and 5. to unspeakable 
sexual horrors; this belief they had 
come to hold as the defining truth 
of their lives and identities. It is not 
surprising that these children 
should have wept and raved when 
the verdict was handed down de- 
nying all that they believed in. 
(Rabinowit/.. 1990. p. 63) 

Notes 

In v icw of the public outcry against the seeming refusal by 
jurors to believe the chiklr n. posttrial statements by these 
same jurors about believing some of the children’s claims 
nuiv have been self-serving. One close observer of the trial 
suggested this possibility to us. 

Several have been the focus of books. The Wee fare ease 
involving Kelly Michaels was the source lor several books, 
including Xti/niinc (Manshel. 1990) and Xoi My ( hilil 
tf row lev. 1990). Other eases have been detailed in teles i- 
sion documentaries (c.g.. the Little Rascals ease involving 
hob Kelly and fiv e other defendants, which was the focus ol 
three frontline documentaries, c.g.. /.ovv <>/ Innocence): 
movies (c.g.. the Country Walk day -cate ease in Miami, 
which vs as the basis of the movie ( n\i>C(ik(ible Acts): and 
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magazine and newspaper articles (c.g., Nathan. 1987; 
Rabinovvitz. 1990). 

* Although this report focuses on the interviewing of alleged 
child sexual abuse victims, the literature reviewed is equally 
important to nonabuse cases that involve the child witness. 
Sexual abuse is of special interest, because this category of 
complaints appears to represent the single largest class of 
actions that eventuate in criminal court testimony (as op- 
posed to neglect cases or custody disputes which are largely 
litigated in juvenile and family court systems). Our discus- 
sion centers on nursery school cases, because, although 
these cases represent only a small proportion of sexual abuse 
complaints, in absolute numbers they involve a large num- 
ber of children (in the McMartin case, for instance, inter- 
\ ievvers under contract to the State of California alleged the 
abuse of 369 children 1 Sauer, 1993|): moreover, day-care 
cases are relevant to the more general testimonial issues 
found in many notulay-care cases (i.e„ repeated suggestive 
questioning, interv iewer stereotypes, failure to test alterna- 
tive hypotheses). Finally, because of their visibility . day- 
care cases are often more extensively documented. 

1 Cases are classified as substantiated o r indicated based on 
how consistent the evidence from an investigation is w ith 
abuse; in most states this is a matter of caseworker judgment. 
States usually have a two-tiered system of classifying inves- 
t igat ions as either substantiated/founded, on the one hand, or 
unsubstantiated/not founded, on the other. Some states use 
a third tier that is intermediate between substantiated and 
unsubstantiated, namely . “indicated,** This term is given to 
cases in w hich the agency doing the investigation may have 
“reason to suspect*' that abuse occurred, but the level of 
e\ idence docs not rise to the level required lor the designa- 
tion “substantiated.** The lowest level of evidence needed to 
substantiate a case is “some credible evidence," which is 
used by 1 8 states, while the highest level of evidence needed 
is “preponderance of ev idence." which is used by 12 states. 
An intermediate level of evidence is used by an additional 1 2 
states, and the remaining states use idiosyncratic terminol- 
ogy (see Figure 5 of National (‘enter of Child Abuse and 
Neglect, 1993. p. 28). Thus, lower levels of ev idence in- 
crease I lie possibility that, upon further investigation, a 
subsequent determination may be made that insufficient 
ev i tie nee exists to designate the presence of abuse, For these 
icasons it is important not to use the terms “substantiation," 
“indication." and “validation" interchangeably. 

' Some have suggested that these two figures be summed to 
S'; . This assumes, however, that there were different chil- 
dren in the two categories, which is not clear from the 
published report. The breakdown reported here is that re 
ported In Say wit/, et al, ( 1990), 



' Courts have taken notice ol the need iti distinguish betw een 
an interviewer whose view reflects a strongly held expecta- 
tion versus the interviewer who possesses relevant back- 
ground information. For example, in Idaho v. Wright the 
Court accepted the argument contained in an amicus brief 
that “there is an important distinction between preconcep- 
tions that can cloud judgment, and background information 
that is needed for thorough evaluation of sexual abuse** 
(Amicus Brief to the Supreme Court in Idaho v. Wright, No. 
89-260. p. 96). 

These data reveal an interesting disjunction with the rea- 
soning that when children retract earlier claims of sexual 
abuse, this is indicative, if not diagnostic, of a truthful 
original report (Sgroi. 1982; Summit. 1983 ). In (his study, it 
was often the ease that children originally made false allega- 
tions. which they then, with gentle persuasion, recanted. 
Were this finding applicable to situations that are abuse- 
related — and we make no such claim here — it could be 
suggested that retraction might also be consistent with an 
erroneous original report. 

' Pilot data from one subject addresses this question. A 
37: -year-old nonabused girl was examined by a pediatri- 
cian. She w as not given a genital examination. Immediately 
after the examination, when interviewed by the experi- 
menter, she correctly said that the doctor had not touched her 
genitals or buttocks. Furthermore, when show n an anatomi- 
cally correct doll and told to show how the doctor had 
touched her genitals and buttocks, she correctly stated that 
lie had not touched her. Three day s later, the same child w as 
shown the anatomically correct doll and asked to show all 
the things that the doctor did to her in her prev ious \ i sit. This 
time, she inserted a stick into the vagina of the doll. I’pon 
I’urthcrquestioning. however, she said that the doctor did not 
do this. Three more day s later, the child vv as asked to use the 
anatomically correct doll to show her father every thing that 
had happened at the examination. ’Phis time, she hammered 
a stick into the doll's v agina and then inserted a toy eaiscope 
into the doll's anus. When asked if this really happened, she 
said 'A cs it did." When her father and the experimenter both 
tried to debrief her with such statements as. “Your doctor 
doesn't do those tilings to little gil ls. You were just fooling. 
We know lie didn't do those tilings." the subject clung 
tenaciously to her claims. Thus, for this one subject, 
repeated exposure to the doll, with minimal suggestions, 
resulted in highly sexuali/ed play. It is critical that such a 
finding he replicated with a large, diverse sample to deter 
mine if this child's response is representative of nonabused 
children, 

Toi example, most coutts disallow expert testimony dun 
speaks directly [nihe ulnnunct/He\fu>nA\\ii\ is. thedeleiulants 



guilt or innocence. In some courts, however, expert wit- 
nesses are permitted to testify as to whether they believe the 
chi id was abused ( see My ers. 1 992 ). One would think that an 
expert's opinion that a particular child was abused might 
have the same effect as speaking to the child's credibility. 
This leads to confusion even among the legal scholars whom 
we have consulted, with one remarking that the courts' 
thinking regarding this issue is little more than “wordplay 
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Background 

Over the last two decades child development 
research has reflected an ever growing concern 
with the daily challenges facing vulnerable and 
socially disenfranchised children and their fami- 
lies (Fisher & Lemer, 1994; Fisher et al., 1993; 
Fisher &Tryon, 1988, 1990). As developmentalists 
seek to apply the scientific method to describe, 
explain, and enhance the status of disadvantaged 
children and youth, they are encountering ethical 
dilemmas to which current guidelines offer in- 
complete answers (Fisher & Rosendahl, 1990). 
The scientific and ethical dimensions of this re- 
search often appear to have mutually exclusive 
goals. Whereas the science dimension involves a 
search for truth through experimental controls, 
the ethics dimension is directed toward the protec- 
tion of participant autonomy and welfare through 
means that often appear to jeopardize controls. 



Researchers must confront questions such as 

• how to balance the need to secure a 
valid sample with the need to protect 
the autonomy of potential subjects 
who may need assistance or be espe- 
cially vulnerable; 

• how to handle researcher-participant 
confidentiality when problems such 
as illicit or abusive circumstances are 
uncovered in the course of conduct- 
ing research; 

• how to take an active part in the 
dissemination and application of re- 
search findings in ways that avoid a 
misinterpretation or overgeneral- 
ization of data that may jeopardize 
the development of sound public 
policy and programs. 
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Traditionally trained developmentalists are 
findings themselves unprepared to resolve dilem- 
mas like these. As a consequence, the integrity 
and adequacy of ethical directives are at the fore- 
front of deliberations about scientific practices as 
they apply to high-risk children and youth. 

Federal and Professional Guidelines for 
Research with Children and Youth 

Formal guidelines for ethical practice in the 
conduct of research with human subjects have 
their origin in the Nuremberg Code (1946), a set 
of international guidelines developed in response 
to the atrocities of Nazi medical experimentation. 
Since then, federal agencies and professional or- 
ganizations have developed ethical codes aimed 
at providing standards of conduct for research 
with adults and minors (e.g.. Protection of Human 
Subjects , Department of Health and Human Ser- 
vices |DHHS], 1991; Ethical Principles of Psy- 
chologists and Code of Conduct, American Psy- 
chological Association. 1992; Ethical Standards 
of the American Educational Research Associa- 
tion, 1992; and Ethical Standards for Research 
with Children , Society for Research in Child De- 
velopment [SRCD], 1993). 

One of the most detailed ethical guidelines 
for behavioral research, often referred to as the 
Belmont Report, was presented by the National 
Commission for the Protection of Human Sub- 
jects of Biomedical and Behavioral Research (De- 
partment of Health, Education, and Welfare 
|DHEW|, 1978). In addition to suggesting spe- 
cific ethical standards for human subjects research, 
the report articulated three moral principles upon 
which ethical conduct should be based: benefi- 
cence, respect , and justice. These principles un- 
derscore the researcher’s obligation to protect 
participant welfare, to respect participant autonomy 
and privacy, and to ensure the equitable distribu- 



tion of benefits and costs associated with research 
participation. Today these three moral values are 
considered the foundation of ethical decision mak- 
ing in research with children and adolescents 
(e.g., Beauchamp & Childress, 1979; Fisher & 
Tryon, 1990; Gaylin & Macklin, 1982; Koocher 
& Keith-Spiegel, 1990). 

Beneficence. The principle of beneficence is 
embodied in the maximization of possible ben- 
efits and minimization of possible harm (DHEW, 
1978). Beneficial outcomes can be accomplished 
through experimentation directly or indirectly. 
For example, research designed to evaluate the 
efficacy of a developmental intervention has the 
potential to benefit directly the child or adolescent 
assigned to the “treatment” group. On the other 
hand, a study designed to describe the develop- 
mental consequences of high-risk conditions of 
childhood and adolescence has the potential to 
benefit indirectly the participant's social group 
through future application of research findings by 
parents, practitioners, or policymakers. 

Research can also expose participants di- 
rectly or indirectly to experiences that result in 
serious or permanent harm. For example, parents 
of children assigned to a “no treatment” control 
group may forego service options for their chil- 
dren available outside the experimental setting 
(Fisher, 1991; Fisher & Rosendahl, 1990). Alter- 
natively, experimental procedures or assessment 
strategies may inadvertently elicit stress in child 
and adolescent research participants. It is because 
of such contingencies that most professional codes 
and federal regulations require investigators to 
weigh the potential balance of harms and benefits 
to participants before deciding to conduct re- 
search (e.g., see Principle 1, SRCD, 1993). At 
present, however, researchers, institutional re- 
view boards (IRBs), and policymakers have little 
guidance in determining the acceptable risk and 
benefit balances. 
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Respect. The principle of respect reflects a 
moral concern that individuals be treated as au- 
tonomous agents and that more vulnerable per- 
sons with diminished autonomy have these rights 
protected (DHEW, 1978). This principle is re- 
flected in professional codes and federal regula- 
tions that require that potential subjects be fully 
informed about the nature of the research, what 
will be required of them, the potential risks and 
benefits of participation, and their right to with- 
hold or withdraw consent without penalty (Faden 
& Beauchamp. 1986; see Principle 2. SRCD, 
1993). 

Consent procedures for research participants 
who are minors require special consideration for 
several reasons. Children and adolescents do not 
have the legal capacity to consent (Melton. 
Koocher. & Saks. 1983). They may lack the cog- 
nitive capacity, depending on the age of the child 
and complexity of the situation, to comprehend 
the rationale or nature of the experimental proce- 
dures (see Thompson. 1990). And they may per- 
ceive they lack powerto refuse participation (Fisher 
& Rosendahl. 1990; Keith-Spiegel. 1983; Koocher 
& Keith-Spiegel. 1990; Levine. 1986). In light of 
these issues, federal regulations and professional 
codes (e.g.. see Principle 3. SRCD. 1993) require 
the informed consent of parents, legal guardians, 
or those who act in loco parentis before a child can 
participate in research. In addition, special child 
assent procedures are recommended for children 
older than 7. According to the SRCD Ethical 
Standards for Research with Children, "Assent 
means that the child shows some form of agree- 
ment to participate without necessarily compre- 
hending the full significance of the research nec- 
essary to give informed consent” (see Principle 2. 
SRCD, 1993, p. 337). The objection of a child of 
any age is binding unless the experimental treat- 
ment holds out the prospect of direct benefit that 
is important to the child and only achievable 
through the research. 



Justice. The principle of justice refers to the 
moral value placed on equitable distribution of 
social benefits and costs (Rawls. 1971). This prin- 
ciple, as articulated in federal and professional 
codes for human subjects research, requires equi- 
table selection and treatment of participants re- 
gardless of nationality, race, age, sex, or social 
status. Justice is expressed in the research design 
of developmental studies when investigators man- 
age to maintain a balance between including di- 
verse populations and remaining sensitive to the 
possibility of differential impacts. From this per- 
spective, it is important to include disadvantaged 
and minority populations in developmental re- 
search, since not to do so runs the risk of depriving 
society of a know ledge base from which suitable 
and effective services can be constructed (Fisher, 
1991; Fisher & Brennan. 1992; Graham, 1992; 
Hagen. Paul. Gibb. & Wolters. 1990; McLoyd & 
Randolph. 1984). 

Overview 

The principles of beneficence, respect, and 
justice can serve as valuable guides for research- 
ers who work with vulnerable populations. This 
report explores the implications of these prin- 
ciples for such research in all its phases — recruit- 
ment. implementation, and the dissemination ot 
findings. The first section provides an overview of 
the ethical challenges associated with informed 
consent procedures for research with children and 
youth. Included are an exploration of the informed, 
rational, and voluntary requirements of consent to 
participate in descriptive and intervention research, 
and of the conflicts that arise when guardian 
consent is unavailable or not in the best interests 
of the child. The second section addresses the 
investigator's responsibilities when problems that 
may jeopardize the minor participant’s well-be- 
ing are discovered during the course of research. 
The researcher must consider the validity of risk 



estimates, how and with whom information should 
be shared, and the federal and state laws regulat- 
ing reporting responsibilities. Finally, the third 
section deals with the ethical concerns associated 
with generalizing the results of research based on 
large social units to individuals and diverse popu- 
lations. Included is discussion of the ethical impli- 
cations of overestimating the applied relevance of 
developmental research, of failing to consider 
individual and cultural differences when design- 
ing and evaluating social programs, and of apply- 
ing the developmental knowledge base to legal 
decisions relevant to the welfare of children and 
youth. The report concludes with recommenda- 
tions for future policies. 

Securing the Sample: 

Informed Consent for Participation in 
Descriptive and Intervention Research 

Informed consent, as currently required by 
federal and professional ethical guidelines, is seen 
by many as the primary means of protecting par- 
ticipants. Consent procedures are designed to pro- 
tect participant autonomy and self-determination 
by ensuring that the decision to participate is 
informed, rational, and voluntary (Freedman, 
1975). Participant rights are further safeguarded 
in the case of children and adolescents through the 
requirement of parental or guardian consent. In 
addition, to ensure that the rights and privacy of all 
persons involved in the research are protected, 
informed consent is obtained from all involved 
individuals. A teacher’s consent may be required, 
for example, when the focus of an investigation is 
the student-teacher interaction (see Principle 4, 
SRCD, 1993). 

The Informed Requirement of Consent 

The informed aspect of consent refers to the 
requirement that prospective participants be pro- 



vided with information regarding all procedures 
that might influence their willingness to partici- 
pate themselves or allow their child to participate. 
Such information includes a description of the 
procedures, the time required to participate, the 
profession and institutional affiliation of the 
investigator(s), foreseeable risks or benefits of 
participation, the extent as well as limits of confi- 
dentiality, the voluntary nature of participation, 
and procedures for obtaining results and conclu- 
sions of the research. For both guardian consent 
and child assent, the informing procedure should 
be appropriate to the language usage and level of 
comprehension of parent and child (Fisher & 
Rosendahl, 1990; Thompson, 1990). In addition, 
the researcher should answer all questions posed 
by participants or their guardians. 

The degree to which parental consent is truly 
informed can be affected by the specific research 
question. For example, in a study exploring the 
developmental correlates of maternal depression, 
the investigator may need to take into account the 
potential impact of symptoms associated with 
depression, such as difficulties in concentration or 
feelings of worthlessness, on the mother’s capac- 
ity to understand the nature of the study and to 
respond in her own best interest and that of her 
child. 

The Rational Nature of Consent 

The rational requirement of consent proce- 
dures reflects an acknowledgement that the con- 
text in which parents of children at risk are re- 
cruited may work against a rational approach to 
consent. The very conditions that identify the 
child as a potential participant may make it diffi- 
cult for parents to recognize the limited or nonex- 
istent benefits of participation in research meant 
to be descriptive rather than intervening. 



For example, a family with a child who 
exhibits a pattern of disabilities but who has no 
definitive diagnosis or treatment may look to any 
contact with professionals as potential assistance; 
and the family may assume erroneously that some 
benefit will come of participation (Carroll, 
Schneider, & Wesley, 1985; Fisher& Rosendahl, 
1990). Members of vulnerable families recruited 
to assess the efficacy of a developmental interven- 
tion may mistake descriptions of potential ben- 
efits of participation for presumed benefits. Thus, 
recruitment procedures for participation in such 
researc h must ensure that participants and/or guard- 
ians fully understand the nature and risks of par- 
ticipation, including the potential consequences 
of random assignment to a control condition that 
has been empirically associated with later devel- 
opmental problems. 

Investigators studying development in low'- 
risk populations must also be sensitive to uninten- 
tional violations of the rational aspect of informed 
consent. For example, a procedure may itself have 
potentially stressful consequences (Fisher & 
Rosendahl, 1990). As an illustration, consider a 
study designed to follow the development of chil- 
dren with social problems that includes a screen- 
ing of kindergartners to identify those high in 
aggressive behaviors or rejected by peers. Let us 
say then that these “aggressive” and "rejected" 
kindergartners are to be the focus of a second 
phase of the study. Some parents of the “aggres- 
sive" children selected for this second phase may 
become concerned that their child has “special" 
problems, and they may misperceive participation 
in the research as a means of helping their child or 
of providing ongoing “diagnosis" of the child's 
"condition." Informed consent procedures that 
draw upon our knowledge of affective factors 
associated with parenting can help to ensure that 
guardian consent is both informed and rational. 



The Voluntary Aspect of Informed Consent 

Informing participants. Investigators who 
study the development of disenfranchised chil- 
dren and their families must be particularly sensi- 
tive to potential violations of the voluntary aspect 
of guardian consent (Fisher & Rosendahl, 1990). 
Vulnerable families, contacted while seeking ser- 
vices at community mental health or medical 
facilities, may be concerned that failure to consent 
will result in a discontinuation of services for 
themselves or for their children (Fisher, 1991). 
Under these circumstances, special care needs to 
be taken to clarify that the right to services is not 
contingent on consent, and that the child can 
withdraw even after consent has been given. 

Some participants and their guardians re- 
cruited for intervention research may prefer either 
the treatment or matched control condition to 
random assignment. The traditional elimination 
of these individuals from the subject pool may 
compromise the voluntary nature of informed 
consent. Some individuals desperate to help them- 
selves or their children may consent to random 
assignment because they see the chance of receiv- 
ing services as preferable to receiving no services 
at all. This procedure may also limit the 
generali/.ability of findings to families who are 
willing to be randomly assigned (Fisher, 1991; 
Fisher & Rosendahl. 1990). For example, adoles- 
cent mothers who prefer nonrandomized assign- 
ment to the treatment group of a parenting skills 
intervention study may have the personal resources 
to benefit most from this type of intervention. A 
study finding no treatment effects for young women 
willing to be randomized might lead to the errone- 
ous conclusion that such a program was ineffec- 
tive for all teenage mothers. A novel approach to 
this problem, the semi-randomized clinical trial 
( Veatch. 1 987), assigns participants to four groups: 
two groups comprised of individuals who have 
agreed to random assignment to either the treat- 



mcnt or control condition, and two groups who 
have selected nonrandom assignment to the two 
conditions. This design respects participant au- 
tonomy and enables researchers to compare the 
responses of participants who agree to randomiza- 
tion with those who do not. 

Incentives. To ensure that parental judgments 
are rational, voluntary, and uncoerced, investiga- 
tors studying at-risk populations need to take 
special precautions when offering inducements 
for research participation, on the grounds that 
undue incentives can invalidate informed con- 
sent. The decision to provide inducements creates 
a tension between compensating individuals fairly 
for their time and inconvenience and subjecting 
them to undue coercion to participate in proce- 
dures to which they might not otherwise consent. 
This is particularly true in the case of the recruit- 
ment of impoverished persons, who. because they 
need the income, may be willing to assume, or 
havetheirchildren assume, extraordinary burdens 
(Levine. 1986). Unfortunately, at present there is 
little consensus on what defines due and undue 
incentives for research participation (Macklin. 
1981 ). 

In making decisions about the use of incen- 
tives. the investigator must consider the ethical 
principle of justice, that is. how the particular 
inducement will affect the equitable distribution 
of both the burdens and benefits of research par- 
ticipation. Ideally, offering incentives should en- 
able a balanced sampling of individuals from all 
socioeconomic levels. However, the varying eco- 
nomic circumstances of potential participants will 
lead to vary ing perceptions of a cash inducement. 
One response to this dilemma has been to take a 
market approach to incentives, whereby partici- 
pation is viewed as an ordinary job requiring 
relatively unskilled labor, and payment is deter- 
mined at a level sufficient to attract the desirable 
number and diversity of research participants 
(Levine, 1986). 



Offering incentives for child and adolescent 
participation requires additional consideration. 
For example, establishing procedures whereby 
cash payments or services go directly to the minor 
participant rather tnan the guardian has been one 
way to ensure that participants and their guardians 
see the child's efforts as fairly compensated. An 
additional safeguard is to offer inducements that 
do ’’not unduly exceed the range of incentives that 
the child normally experiences” (see Principle 5. 
SRCD, 1993. p. 337). 

The Best Interests of the Child 

Requiring guardian consent assumes that the 
child comes from a reasonably secure family set- 
ting in which the child and his or her parents share 
loving relationships (Gaylin & Macklin. 1982: 
Levine. 1986). However, significant numbers of 
children from high-risk physical or social envi- 
ronments have undetermined custody, nonrela- 
tive guardians, or state guardianship (Gibbs. 1 990: 
Hendren. 1991). In addition, high-risk physical 
and social conditions may in themselves make 
obtaining consent from identified guardians diffi- 
cult. In the absence of specific consent policies 
regarding the "best interests of the child." devel- 
opmental investigators are torn between exclud- 
ing children with uncertain custody or waiving 
parental consent and the participant protection 
such consent is designed to afford. 

Passive consent. Difficulties in acquiring 
guardian consent from at-risk populations has led 
in recent years to renewed debate over the use of 
passive guardian consent. Passive consent is the 
procedure by which guardians are sent forms 
describing the research and are asked to respond 
only if they do not wish their child to participate. 
This procedure does not meet the criterion of 
ethical practice implied by the principle of respect 
for self-governance, in that the researcher cannot 
know for certain that the child's guardian received 
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the information or that the guardian’s failure to 
respond reflects agreement to have the child par- 
ticipate. Recently, investigators have drawn upon 
the principle of beneficence to argue that since 
active consent procedures produce lower rates of 
return for low-income, disadvantaged youth, pas- 
sive consent procedures are justified to ensure that 
these populations will receive the benefits of 
knowledge derived from scientific inquiry. The 
principle of justice points, however, to the ethical 
vulnerability of this argument: the use of passive 
consent with disadvantaged populations means 
that children of low-economic status, in contrast 
to children from middle-income homes, are not 
receiving the equal protection afforded by guard- 
ian consent. 

U nderlying the advocacy for passive consent 
is an implicit assumption that a caring and knowl- 
edgeable guardian would perceive the research as 
important and as desirable for his or her child. 
Equally unsupported, but more unfortunate, is the 
additional inference that disadvantaged parents 
w ho do not return consent forms either lack the 
knowledge to appreciate the importance of the 
research or are unconcerned about their ehild's 
welfare. No empirical data exist to support either 
of these assumptions, nor is there research com- 
paring the response rates for middle-class and 
low-income parents for studies that would have 
comparable meaning to the lives of their children. 
In fact, many people from low-income, ethnic 
minority communities may look upon research 
projects with suspicion. The IQ debate and asso- 
ciated tracking movements in schools, the scandal 
surrounding the Tuskegee syphilis study, and the 
brewing scandal over government radiation ex- 
periments in the 1940s through the 1960s serve to 
undermine trust in social scientists as guardians ol 
ethical treatment when prospective subjects are 
minorities (J. 1 ; . Jackson, personal communica- 
tion, January 6, 1994). Given the paucity of data 
and the potential threats of passive consent to 



participant autonomy and privacy, these unwar- 
ranted assumptions risk substituting investigator 
paternalism forparental permission. Consequently, 
an empirical assessment of who fails to consent 
when active consent is used with different popu- 
lations of high-risk youth, and how we can de- 
velop more effective recruitment procedures to 
increase active guardian consent, should be an 
important part of the nation’s child and adolescent 
research agenda. 

Can adolescents participate in research 
w ithout guardian consent '! The social challenges 
facing today's youth raise questions concerning 
the rights of minors to consent to research partici- 
pation without the knowledge or permission of 
their legal guardians. For example, the require- 
ment that guardians give consent for the research 
participation of adolescents in studies on sub- 
stance abuse or sexual activity, may. rather than 
protect participant rights, violate a teenager's pri- 
vacy or even jeopardize his or her welfare ( Brooks- 
Gunn & Rotheram-Borus. in press). 

In recent years the legal-medical model de- 
fining the rights of adolescents to consent to medi- 
cal interventions has provided a framework for 
evaluating the conditions under which guardian 
consent for research participation might be waived 
(Holder. 1981 ). All states currently allow minors 
to consent independently to treatment for venereal 
disease, and states specify statutory ages (usually 
15 or 16) at which teenagers may consent to 
medical procedures (including, in some states, 
alcohol or drug abuse treatment) without parental 
involvement (Holder. 198 1 ). In that the law grants 
teenagers the autonomy to make decisions con- 
cerning their medical treatment in such instances, 
it seems both reasonable and ethical to grant them 
the same autonomy to decide whether or not to 
participate in research that examines their reasons 
for or reactions to seeking these treatments ( Scarr. 
in press). Moreover, each state has statutes del in- 



ing the conditions of emancipation, such as ado- 
lescent parenthood or independent living arrange- 
ments. which can be used as guides for ethical 
decisions about the rights of minors to give au- 
tonomous consent for research participation. 
Developmentalists should nevertheless guard 
against simply substituting such statutes for more 
principled decisions about guardian consent waiv- 
ers based on developmental considerations. 

Federal guidelines for research with children 
( DHHS. 1 983 ) allow an institutional review board 
(1RB) to determine the conditions under which 
requiring parental consent is not a reasonable 
protection of minor participants. In so determin- 
ing. the IRB can waive parental permission if 
appropriate mechanisms for protecting the child 
are substituted. It would seem, based on this prin- 
ciple. that guardian consent would be waived — 
for instance, in the case of teenage runaways — if 
procedures are established to ensure that the 
adolescent's own consent is informed, rational, 
and voluntary. Researchers must be mindful, 
however, of Robert Veatch's (1981) cautionary 
comments that guardian consent should not be 
waived simply to facilitate the research process. 
Decisions regarding whether adolescents should 
participate in research without parental consent 
should be based on the potential benefits to the 
participant rather than the utility needs of the 
researcher. 

Problems Identified During the Course of 
Research: Sharing, Referring, and Reporting 

In the course of conducting their studies with 
high-risk children, researchers may utilize assess- 
ment instruments or specific methodologies that 
yield information indicating that a research par- 
ticipant has previously undetected cognitive. 



socioemotional. or physical problems. Underthese 
circumstances the principle of beneficence would 
lead the investigator to consider whether to share 
such information with participants and/or their 
parents, to provide information about referral ser- 
vices. or. under extreme circumstances, to report 
the problem to local authorities (Fisher, in press: 
Fisher & Rosendahl. 1990). 

Developmental researchers have tradition- 
ally been reluctant to communicate the results of 
individual assessments to parents, both out of 
concern for confidentiality and a healthy skepti- 
cism regarding the clinical relevance of research- 
derived information (Asher. 1993: Brooks-Gunn 
& Rotheram-Borus. in press: Fisher & Brennan. 
1992). In fact, one recent survey of leading devel- 
opmental journals found that less than 1 % of the 
empirical articles mentioned that information was 
shared with parents or that referrals were made for 
developmental problems discovered during the 
course of research (Fisher & Brennan. 1992). 

According to Principle 9 of the SRCD Ethi- 
cal Standards for Research with Children, 

When, in the course of research, infor- 
mation comes to the investigator's at- 
tention that may jeopardize the child's 
well-being, the investigator has a re- 
sponsibility to discuss the information 
with the parents or guardians and with 
those expert in the field in order that 
they may arrange the necessary assis- 
tance. (SRCD. 1993. p. 338) 

In deciding whether to share research -de- 
rived information about child and adolescent par- 
ticipants, the investigator must consider the valid- 
ity of the risk estimates, how and with whom 
information should be shared, and the federal and 
state laws which regulate the reporting responsi- 
bilities of researchers working with minors. 
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The Validity of Risk Estimates 

Any decision to share information obtained 
during the course of research must include an 
initial consideration of whether the experimental 
procedures provide a valid means of identifying 
the presence of specific developmental disorders 
or risks (Fisher, 1991; in press). Investigators 
must guard against underestimating or overesti- 
mating the predictive utility of developmental 
measures and should proceed with caution when 
making evaluative statements or giving advice 
(see Principle 13. SRCD. 1993). 

Infonnation derived from diagnostic mea- 
sures. Investigators, in examining the develop- 
ment of children of vulnerable populations, often 
utilize standardized measures of individual psy- 
chological functioning. The Wechsler Intelligence 
Scale for Children ([WISC-R], Psychological 
Corporation, 1991 ), the Child Behavior Checklist 
(iCBCL|, Achenbach & Edelbrock. 1983), and 
the Children's Depression Inventory ([CDIj, 
Kovacs. 1981 ) are commonly used to assess tie 
cognitive or emotional status of child and adoles- 
cent research participants. To the extent that these 
assessments have diagnostic validity, the re- 
searcher may come to possess information about 
potential conditions that jeopardize the child’s 
well-being. For example, a pattern of WISC-R 
responses may suggest a learning disability; the 
CD1 may yield information indicating childhood 
depression; and maternal reports on the CBCL 
may indicate anxiety disorders of childhood or 
adolescence. When this happens, investigators 
may bear a responsibility to share information in 
order that the child obtain needed assistance (see 
Principle 9, SRCD. 1993). 

Information derived from descriptive mea- 
sures. Some assessment instruments that are valid 
for identifying group differences in the experi- 
mental setting may not have the psychometric 



properties necessary fora meaningful assessment 
of individual abilities or disabilities (Fisher, in 
press; Fisher & Brennan, 1992). Sharing with 
parents such research-derived observations about 
a minor participant runs the risk of creating stress- 
ful or harmful misconceptions about the child's 
development. 

For example, while adolescent experimenta- 
tion with alcohol has been associated with depres- 
sion, school problems, delinquency, and unpro- 
tected sex (Kandel, Davies, Karus, & Yamaguchi, 
1986), we do not yet have sufficient information 
about either the degree of alcohol consumption at 
which such effects are seen or the specific person- 
ality characteristics or social factors involved to 
determine whether a particular adolescent will 
progress to abuse of alcohol during the teenage 
years or later in life (Pentz, 1994). Given the lack 
of empirical evidence documenting individual risk 
for teenage alcohol use, sharing information on 
adolescent drinking behaviors with parents or 
discussing the implications of specific responses 
with the adolescents themselves could be poten- 
tially misleading and outside the boundaries of 
good ethical practice for developmental scien- 
tists. On the other hand, as outlined in the next 
section, in the absence of diagnostically valid 
measures of individual functioning, researchers 
may wish to incorporate more general informa- 
tion about psychological or social services into 
their standard debriefing procedures (Fisher & 
Rosendahl, 1990). 

Clarifying the Limits of Communication end 
Referral Procedures 

Once an investigator has determined the di- 
rect or indirect relevance of empirical observa- 
tions to participant welfare, she or he must decide 
what type of information will be shared and with 
whom. Advance planning is the best means of 
establishing communication and referral procc- 
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dures that will be well received by participants 
and their guardians and can be successfully imple- 
mented. 

Clarifying the extent and limits of communi- 
cations as part of consent procedures can help 
establish realistic participant and guardian expec- 
tations about information the investigator can and 
cannot provide (Fisher, in press). For example, 
researchers should not promise children or ado- 
lescents confidentiality if they are prepared to 
share some portion of the child's experimental 
responses with parents (Asher. 1993; Brooks- 
Gunn & Rotheram-Borus, in press). In contrast, if 
information is not to be shared, investigators should 
not give parents the impression that participation 
in the research will advance their knowledge about 
theirchild. As noted in the previous section, speci- 
fying the limits of information that can be shared 
helps ensure that guardian consent is both in- 
formed and rational. 

Providing Assessment Results 

When an investigator has determined that it 
would be to a child's advantage to share valid 
diagnostic information with parents, she or he 
must ensure that test administration, interpreta- 
tion, and communication of results meet current 
standards of ethical practice. Research personnel 
must be competent to administer and score tests 
appropriately. In interpreting results they must 
take care to identify situations in which particular 
assessment norms may not be applicable to indi- 
vidual diagnoses (e.g., for children from language 
communities or cultural groups not included in 
the population on which the test was standard- 
ized). 

Research personnel must also be trained to 
explain the assessment results to parents in lan- 
guage that is understandable. Moreover, if infor- 
mation is to be shared with individuals other than 



the child's guardian (e.g., the child’s pediatrician 
or academic advisor), the investigator should en- 
sure that these individuals are qualified to use and 
interpret psychological assessments. Information 
should never be shared with other professionals 
without the explicit, written consent of the guard- 
ian (Fisher, in press). 

Providing Referral Information 

In recent years, developmental researchers 
have generated information on the risk and pro- 
tective factors associated with a wide range of 
child and adolescent behaviors. For example, we 
know that peer rejection is associated with lone- 
liness in elementary school children (Asher & 
Dodge, 1986), that low self-esteem is correlated 
with maladjustment, depression, and anxiety in 
both school and peer relations (Damon, 1983), 
and that adolescent pregnancy is a marker variable 
for a cluster of developmental problems and risk 
factors facing young women (Chase- Lansdale & 
Brooks-Gunn, 1994). 

Although the lack of established diagnostic 
assessments based on this knowledge base pre- 
clude sharing the results of individual perfor- 
mance, investigators working in these areas can 
provide participants or their guardians informa- 
tion about existing services relevant to the devel- 
opmental issues under investigation (Fisher & 
Rosendahl, 1990). For example, an investigator 
studying the development of children’s self-es- 
teem can inform parents about the range of behav- 
iors associated with positive and negative self- 
concepts, and provide a list of community agen- 
cies offering psychosocial evaluations and treat- 
ment of developmental problems. Care should be 
taken, however, that parents understand that the 
investigator is not recommending assessment or 
treatment, but merely providing information about 
such services, should the parent be concerned 
about his or her child's development. 



Identifying such services, especially low- 
cost services, may require the investigator to do 
some “detective” work within the professional 
community. The difficulties encountered in find- 
ing appropriate service systems for children :\nd 
adolescents can help the investigator understand 
and appreciate participants’ service options, 
thereby enriching future research designs focused 
on the interface between developmental risk and 
community supports. 

Working with practitioners. In some situa- 
tions the researcher may be in a position to iden- 
tify specific professionals to assist participants 
with serious problems. For example, a devel- 
opmentalist studying the correlates of adolescent 
pregnancy may establish a relationship with a 
medical facility that can provide prenatal or post- 
natal care to teenage research participants. Refer- 
rals to such services can be made for participants 
who lack medical care. An investigator can also 
collaborate with mental health professionals to 
develop referral procedures for participants in 
need of psychological services, including crisis 
intervention in the case of the participant whose 
responses to the experimental procedure indicate 
a grave situation or life-threatening behaviors 
calling for immediate attention (Fisher, 1993). 
Investigators can enlist practitioners during the 
study's design phase to help develop guidelines 
for identifying participant problems and for han- 
dling consultation with participants as situations 
arise (Fisher & Ro^endahl, 1990). 

Allowing for self-referral. Asher (1993) has 
developed and tested an innovative procedure of 
self-referral devised for peer-rejected and lonely 
children recruited as part of normative samples in 
the schools. With the cooperation of school ad- 
ministrators and staff, Asher gave all child partici- 
pants the opportunity to check a box if they wanted 
to speak to the school social worker about “things 
that bother you,” “questions you might have about 



things you want to change,” or “learning how to 
make friends.” Asher's finding that low-accepted 
children were more likely to self-refer than other 
children strongly supports the proposal that self- 
referral is an ethical procedure that can help high- 
risk children seek assistance. Continued refine- 
ment of this procedure will help identify condi- 
tions under which self-referrals should or should 
not require parental consent (Scarr, in press). 

Providing treatment for control group par- 
ticipants. Intervention research poses additional 
ethical problems in that the standard treatment- 
control group design intentionally denies services 
to children and youth with identified developmen- 
tal risk. There is a growing and persuasive litera- 
ture indicating that basic services, such as nutri- 
tion supplements, medical care, and education, 
can help overcome biological and environmental 
risks (e.g.. Chase-Lansdale & Brooks-Gunn, 1 994; 
Wasik. Ramey, Bryant. & Sparling, 1990). This 
raises ethical concerns about the use of no-treat- 
ment control groups for the evaluation of new 
intervention strategies in the face of empirical 
evidence that such basic services will benefit 
participants (Fisher & Rosendahl, 1990). Provid- 
ing control groups with basic services can jeopar- 
dize conclusions that the treatment per se is sig- 
nificantly better than no treatment at all. But 
comparng psychosocial intervention to less costly 
basic health and social services may be the more 
honest intellectual question, especially in times of 
limited economic resources (Fisher. 1991). 

When no prior treatment strategies have 
been established, a primary ethical consideration 
is whether to provide services for control group 
participants after the experimental intervention 
has been evaluated. At the completion of clinical 
trials research aimed at assessing the efficacy of 
short-term intervention strategies, it is standard 
ethical practice to offer adult control groups the 
treatment found to be most effective. However, at 



the conclusion of long-term studies designed to 
evaluate the effect of an early intervention on later 
development, the treatment may be developmen- 
tally inappropriate for the (now older) controls. 
One way to address this issue is to utilize 
postintervention assessments for referrals to ap- 
propriate community services (Fisher. 1991 ). 

Reporting Illegal and Abusive Behaviors 

Investigators working with high-risk popu- 
lations may. during the course of research, be- 
come privy to information suggesting illegal con- 
duct, use of addictive products, or threats of harm 
or violence to the participant or identified others. 
Researchers need to be aware that certain deci- 
sions — for example, whether to report to authori- 
ties suspected physical or sexual abuse of child 
participants, or to disclose to the court research- 
derived information about illegal participant ac- 
tivities — are not only ethical decisions, but in 
many states have legal implications as well 
(Appelbaum & Rosenbaum, 1989; Fisher. 1991; 
Hoagwood. in press; Liss. in press). 

Child abuse and neglect. In 1976. federal 
law (the Child Abuse Prevention and Treatment 
Act) began to require that states enact statutes to 
mandate the reporting of suspected child abuse or 
neglect. In an extensive review of the 50 different 
state statutes on child abuse and neglect reporting, 
Liss (in press) documented that in at least 13 
states, researchers, as members of the general 
citizenry, are mandated reporters. Other states 
vary as to whether researchers are included in the 
list of mandated reporters, and. if included, whether 
reporting is required if problematic information is 
obtained outside the research protocol. As Liss 
points out, until ease law decisions can provide 
more guidance, researchers must rely on review- 
ing their own state laws to determine their per- 
sonal responsibilities in child abuse and neglect 
reporting. Developmental scientists engaged in 



multidisciplinary collaborations should also de- 
termine whether their colleagues (e.g., pediatri- 
cians, school psychologists) are mandated to re- 
port abuse, because whether they are or not has 
implications for researcher-participant confiden- 
tiality and the protection of information discov- 
ered during the course of research (Fisher, 199 1 ). 
Having determined their own reporting responsi- 
bilities, investigators must also ensure that re- 
search assistants are trained to recognize indica- 
tors of abuse and to follow appropriate proce- 
dures. Finally, the limits of confidentiality, as 
determined by state reporting laws, should be 
fully communicated to both parents and minor 
participants at the time of informed consent. 

Violent behavior. A recent concern of men- 
tal health researchers is whether they share with 
their practitioner colleagues the responsibility to 
prevent the occurrence of violent behavior by a 
research subject (Appelbaum & Rosenbaum, 
1989). This issue is particularly relevant to devel- 
opmental scientists who study family violence or 
delinquency. 

In 1969, Prosenjit Poddar. a student at the 
University of California Berkeley who had been a 
client at the university student health center, mur- 
dered Tatiana Tarasoff. Prior to the murder, the 
psychologist counseling Poddar had notified the 
campus police that his client was dangerous, but 
he withdrew a letter recommending confinement 
after the police concluded Poddar posed no harm. 
Following a series of appeals by the Tarasoff 
family, the Supreme Court of California in Tarasoff 
v. Regents of the University of California ( 1976) 
concluded that the therapist had a duty to warn 
third parties about dangers posed by clients. In 
response to this decision, several states have held 
that a psychotherapist has a duty to protect poten- 
tial victims from a patient's violent acts if the 
therapist ( I ) has a “special relationship" with the 
patient. (2) can predict that violence will occur. 
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and (3) can identify the potential victim. While no 
court case has yet applied this duty-to-protect to 
the research context, given appropriate circum- 
stances (e.g.. the special relationship, prediction 
of violence, and victim identification), some re- 
searchers working with violence-prone popula- 
tions may in the future be expected to assume the 
same duty (Appelbaum & Rosenbaum. 1989). 

The Certificate of Confidentiality. Develop- 
mental scientists may also find themselves in the 
position of collecting information about past abuse, 
violence, or illegal activities. The researcher needs 
to be aware, and must inform his or her research 
participants, that confidential research records are 
subject to subpoena (Melton. 1990a. 1990b: see 
Principle 1 1. SRCD. 1993). Clearly, such legal 
vulnerability creates barriers to participant re- 
sponding and recruiting; potential participants are 
unlikely to agree to participate or answer honestly 
if they believe their responses will result in legal 
proceedings against them. To protect the privacy 
and identity of research participants, investigators 
can apply for a Certificate of Confidentiality is- 
sued by the Department of Health and Human 
Services (Hoagwood. in press). This certificate 
protects the researcher against legally compelled 
disclosure of identifying information about indi- 
vidual research participants. However, the certifi- 
cate does not override all reporting requirements. 
While it can protect the privacy of participant 
reports of sexual activities, substance abuse, ille- 
gal conduct, psychological well-being, and past 
violent or abuse acts, it may not override state and 
local laws governing the reporting of current child 
abuse or communicable disease. From an ethical 
standpoint, researchers who have been granted 
the certificate should inform their participants 
about the extent and limitations of its protection 
and plan furthei ways to protect the confidential- 
ity of their data (Hoagwood. in press). 



Caveats for Sharing, Referring, and Reporting 

This section has addressed the issues re- 
searchers must consider when they obtain infor- 
mation suggesting that participants need assis- 
tance. However, as Scarr (in press) has pointed 
out, researchers must be careful not to go beyond 
their areas of competence or to sacrifice their 
primary responsibility to the research by offering 
referrals, when little is known about how to 
remediate or prevent the developmental problems 
under investigation. 

Developmental scientists must also be wary 
about overreporting or overreferring children and 
adolescents from ethnic minority and low-income 
families (Edelbrock, 1994; Scott-Jones, in press). 
Misunderstandings between researchers and par- 
ticipants who are culturally different, the paucity 
of standardized assessments and appropriate treat- 
ments for ethnic minority children and adoles- 
cents, and the disproportionate extent of abusive 
or violent behaviors that may occur in impover- 
ished communities may lead to referrals to agen- 
cies or reports to authorities that create an unfair 
burden on poor and culturally diverse children 
and their families (Jackson, 1993; S. arr. in press; 
Scott-Jones, in press). 

Generalizing the Results of Research to 

Individuals and Diverse Populations 

A cornerstone of experimental methodology 
involves utilizing large numbers of subjects to 
generate group means that can then provide infor- 
mation about cognitive, socioemotional, behav- 
ioral, or physiological developmental patterns in 
normative and atypical populations. However, as 
“knowledge consumers” (e.g., parents, prac- 
titioners, and policymakers) come to rely increas- 
ingly on the developmental science knowledge 
base, investigators must examine their own re- 
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sponsibility to limit possible misapplications of 
findings derived from group data to policy deci- 
sions that affect the lives of individual children 
and youth (see Principle 14, SRCD. 1993). 

Overestimating the Relevance of Developmental 
Research to Practical Applications 

The question of how much responsibility 
scientists should bear for the potential applica- 
tions of their research has generated heated de- 
bate — both about participant autonomy, i.e., 
whether participants should be informed about the 
implications of the research to social policy, and 
about academic freedom, i.e., how much social 
considerations should influence the pursuit of 
scientific truth (Fisher & Rosendahl, 1990: 
Sarason. 1984: Scarr, 1988). It is increasingly 
clear to action researchers that in order to select 
appropriate experimental variables in studies of 
social problems, it is essential to consider the 
public practices and social circumstances facing 
members of a targe: group ( Baumrind. 1 990: Fisher 
& Rosendahl. 1990). Less clear is the obligation 
investigators have to consider the social implica- 
tions of their research when its relevance to public 
policy and practice is less direct, that is, what 
obligation they may have to clarify the limitations 
of their data or to refrain from proposing action 
beyond what is known (Hoffman, 1990). 

For example, developmental research has 
demonstrated a relationship between infant-mother 
attachment and certain social competencies and 
risks in early childhood (e.g., Bretherton & Wa- 
ters, 1985: Matas, Arend. & Sroufe, 1978). These 
important findings are based on comparisons of 
group means and observations of behavioral pat- 
terns in groups of infants identified as securely or 
insecurely attached, as measured by the Strange 
Situation procedure (Ainsworth. Blehar, Waters. 
&. Wall. 1978). While developmentalists may rec- 
ognize the limitations of their research as it ap- 



plies to an individual child, knowledge consumers 
(e.g.. policymakers, therapists) may not. In fact, 
policymakers have looked in recent years to at- 
tachment research in the infant day-care setting to 
inform policy decisions concerning maternal leave 
and government-sponsored child care programs 
(Belsky, 1986, 1989: Phillips. McCartney, Scarr, 
& Howes, 1987). Moreover, child and family 
therapists are increasingly interested in using the 
Strange Situation procedure as a clinical tool for 
assessing parent-child relationships. 

Since the predictive validity of attachment 
classifications for later cognitive and socio- 
emotional adjustment in individual infants and 
infants from different cultural backgrounds is yet 
to be empirically verified (Bates, Maslin, & 
Frankel, 1985: Erikson, Sroufe, & Egeland. 
1985: Fracasso, Busch-Rossnagel, & Fisher, in 
press; Grossmann, Grossmann, Huber. & Wartner. 
1981: Lamb, 1987; Sagi & Lewkowicz, 1987: 
Thompson, 1988), it would be inappropriate at 
this point to assume that knowledge of a specific 
infant’s attachment classification is a valid indica- 
tor of individual risk or vulnerability. While re- 
searchers cannot be held ethically responsible for 
the less obvious implications others may draw 
from their findings, they do have an ethical re- 
sponsibility, in studies of socially sensitive issues 
( as infant-mother attachment has recently become ). 
to anticipate the potential real world implications 
of their research and to clarify in their writings and 
public presentations the extent, as well as the 
limits, of the application of their findings to prac- 
tice or social policy (Fisher & Rosendahl. 1990: 
Hoffman. 1990; Scarr, 1990). 

Evaluating Social Programs 

Limits of single-strategy interventions. The 
ethics of applying group findings to the lives of 
individual children and youth also come into play 
in empirical evaluations of social programs. So- 



eial programs typically prescribe a single inter- 
vention to promote the development and welfare 
of children or adolescents at risk (Baumrind. 1 990). 
The success of the program in meeting its goals is 
then usually assessed by comparing youth who 
participated in the program with those who did 
not. If analyses indicate that the intervention 
group scored significantly higher on outcome 
measures than the control or comparison group, 
the program is judged a success. If the difference 
between the groups is not statistically significant, 
the program is considered a failure and may face 
being abandoned. 

While aggregating individual responses to a 
program increases the power to detect interven- 
tion effects, it also decreases the opportunity to 
learn who is and who is not benefiting from the 
program. As Lewis points out (Lewis. 1990: Lewis 
& Sullivan, 1994). basing policy decisions solely 
on statistical differences between groups runs 
counter to current empirical information about 
individual differences and variations in human 
development. In fact, based upon our knowledge 
of the contextual factors influencing develop- 
ment. we should assume that individuals who vary 
in their prior experiences, attitudes, and abilities 
would in turn vary in their responses to different 
intervention strategies (Berliner & Cahen. 1978: 
Lcrner & Tubman. 1990). For example, research 
has indicated that adolescents use drugs for differ- 
ent purposes depending on contextual factors. 
High school students most at risk for long-term 
drug problems are school dropouts or adolescents 
from unmotivated families or families modeling 
substance abusive behavior (Pentz, 1994). Ac- 
cordingly. a school-based drug prevention pro- 
gram with an educational/information focus may 
work well for youth who function well academi- 
cally and socially, but it may not meet the needs 
of more troubled and less motivated students. 

To assume that single intervention strategies 
will universally meet the needs of heterogeneous 



populations can have serious repercussions for 
children and their families, especially members of 
disadvantaged or ethnic minority populations 
(Laosa, 1990). First, adherence to single-strategy 
interventions runs the risk of leading to the con- 
clusion that the majority of social programs fail 
(Lewis, 1990). Second, program failure is too 
often attributed to negative attributes of the target 
population rather than to limits of the research 
methodology or the intervention itself. Lastly, a 
negative program evaluation often results in the 
decision to abandon the problem, rather than the 
resolve to find more effective intervention tech- 
niques (Scarr, 1990). Thus, evaluations that ig- 
nore the interaction of participant variability with 
program features can result in families most in 
need being denied services that could promote 
their children's development. 

Multistrategy approaches to program de- 
sign and evaluation. Given the limitations of 
single-intervention approaches, developmental 
scientists must seek to avoid cost-dictated com- 
promises in evaluation research and pursue new 
program design and evaluation methodology to 
address the heterogeneity of child and adolescent 
populations. This can entail three steps: (I) ex- 
ploring how members of the target population 
who benefit from the intervention may differ sys- 
tematically from those who do not: (2) retesting 
the intervention by evaluating its effect on a new 
set of participants with the characteristics of those 
whom step 1 show to benefit from the program: 
and (3) designing new intervention strategies and 
outcome measures to fit the characteristics of 
those who did not benefit from the original pro- 
gram (Lew'is. 1990: Lewis & Sullivan. 1994). 

Such a multi strategy approach does hold the 
risk that certain subgroups of individuals will be 
stigmatized or assigned to services that inadvert- 
ently keep them out of the mainstream (Laosa. 
1990). For example, assigning to drug abuse pre- 
vention programs only those high school students 



who fit a high-risk profile for long-term drug 
usage can lead to negative peer labeling (Pent?.. 
1994). One way to avoid such stigmatizing is to 
structure the intervention to incorporate elective 
and self-referral procedures (Pent?. 1994). 

Abandoning poor intervention strategies, not 
children's problems. Making the evaluation of 
social programs more rigorous does not mean that 
all programs will be evaluated positively. The 
continued use of ineffective intervention strate- 
gies is just as pressing an ethical issue as the 
abandonment of programs that are helping par- 
ticular groups within the population. Growing 
fiscal constraints on federal, state, and commu- 
nity-sponsored social programs call for research- 
ers to evaluate not only the benefits but the human 
and economic costs of social policies. Money 
spent on ineffective programs is money lost to 
effective programs. Therefore, developmentalists 
have an ethical responsibility to provide program 
managers and policymakers with fair evaluations 
of program efficacy. Researchers, and policy- 
makers, should not be afraid to abandon programs 
that do not work (Scarr, 1990). Rather, investiga- 
tors can help design multistrategy approaches that 
will provide policymakers with the information 
they need to attack, not abandon, the problems 
facing today's children and youth. 

Respecting the Cultural Dimensions of 
Children s Lives 

The principle of respect acknowledges that 
research participants are themselves moral agents 
who have the right to judge the purpose and 
procedures of research in which they are asked to 
participate (Fisher & Fyrberg. in press). Threats to 
participant autonomy can stem from two sources: 
( 1 ) research activities that usurp the parental role, 
and (2) unrecognized investigator bias. 

Intervention goals and participant values. 
Most developmental intervention programs oper- 



ate on a two-part assumption: first, that the trajec- 
tory of certain behavioral patterns found within 
certain populations will lead to undesirable out- 
comes if "uncorrected": and second, that if "cor- 
rected," the new patterns will relate to successful 
academic and social adaptation (Sigel, 1990). 
Researchers do not usually question whether the 
program might be at odds with the family values 
of participants, nor whether parents or partici- 
pants consider the "corrected” behaviors or out- 
comes desirable. 

Consider, for example, a social program that 
aims at increasing the academic assertiveness of 
Puerto Rican or Korean immigrant children by 
encouraging the child to make autonomous deci- 
sions and question adult decision making. These 
educational objectives may run counter to the 
parents' beliefs about how a child should behave, 
and this in turn can have negative consequences 
for minor participants. Some evidence suggests 
that behavioral and emotional problems can in- 
deed emerge in late childhood and adolescence as 
a result of the breakdown of the hierarchical struc- 
ture that characterizes many traditional Hispanic 
and Asian families (Fracasso& Busch-Rossnagel, 
1992; McAdoo, 1990: Ryu & Vann. 1992). 

It has been suggested that in using develop- 
mental knowledge to change individuals, research- 
ers should incorporate both the etic (or objective) 
values of the researcher and the emic (or internal) 
values of the child's culture (Jackson, 1 993: Sigel, 
1990), This recommendation extends to program 
evaluations as well: outcome measures that do not 
take the child's family heritage and values into 
account may underestimate children's potential 
and mislead parents, educators, and policymakers. 

Investigator biases. Researchers must also 
appreciate how their own values may be influ- 
enced by social attitudes and, in turn, influence 
policy (Fisher & Tryon, 1990). Throughout the 
relatively short life span of the science of human 
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development, society's changing attitudes toward 
children and families have influenced develop- 
mental theory, interpretations of data, and the 
consequent conceptualization of the child and 
adolescent. The impact of experimenter social 
values on research and theory is evident, for ex- 
ample. in the central role of religious and racial 
categories in the pioneering work of G. Stanley 
Hall (Youniss, 1990) or in the contemporary bi- 
ases affecting inquiry into the developmental cor- 
relates of maternal employment ( Hoffman. 1 990). 

Investigator bias can intrude into every phase 
of the research enterprise. It can effect ( 1 ) con- 
straints on research design (e.g.. requiring major- 
ity control groups for descriptive research on 
ethnic minority children’s development); (2) the 
neglect or pursuit of differences between sub- 
groups (e.g.. gender or ethnic differences); (3) the 
entrenchment of particular paradigms (e.g.. study- 
ing the normative development of majority chil- 
dren and the problematic development of minor- 
ity children); and (4) the choice of analyses (e.g.. 
whether to pursue an ANOVA three-way interac- 
tion). A stereotyping bias can also lead research- 
ers to view ethnic minorities as monolithic, thereby 
denying members of subgroups their individual- 
ity and opportunities for program benefits 
(Hoffman. 1990; Korchin. 1980;McAdoo. 1990). 

Developmentalists must acknowledge their 
own values and respect those of research partici- 
pants. if interventions are to be shaped to serve 
diverse populations. Parents, practitioners, and 
policymakers will benefit from research that is 
cautious with respect to investigator values and 
sensitive to the cultural dimensions of children's 
lives. 

Developmental Psychology and the ( on its 

Evidence from developmental research is 
increasingly finding its way into our legal system. 
Studies relevant to child custody decisions, the 



rights of minors to consent to medical procedures, 
and the ability of children to serve as witnesses are 
just a few examples of the research contribution to 
the knowledge base that is being incorporated by 
judges and jurors. 

Research designed to address legal ques- 
tions. Various moral obligations — derived from 
the principle of beneficence — inevitably emerge 
as developmentalists seek to generate knowledge 
to inform the legal system. The impact of applied 
research on social decisions can be positive or 
negative (Fisher & Tryon. 1990). Data that are 
highly reliable and ecologically valid can provide 
the court with valuable information, but data that 
show inconsistencies or are generated under ex- 
perimental conditions vastly different from the 
real world phenomena of interest can. if utilized 
by the court, jeopardize the legal rights of defen- 
dants or witnesses. Investigators must thus grapple 
with the difficulty of balancing external validity 
(ensuring that experimental conditions correspond 
to the true nature of the phenomena in question) 
and internal validity (controlling extraneous vari- 
ables in order to strengthen conclusions to be 
drawn from findings). 

In efforts to achieve this balance, some in- 
vestigators choose to conduct field studies ot 
children in their real world situations. For ex- 
ample, researchers can assess psychological func- 
tioning and family processes before and after 
custody decisions (Hetherington. Cox. & Cox. 

1 98 1; Shaw & Emery. 1987; Wallerstein& Kelly. 
1980). Others create well-controlled and forensi- 
cally realistic contexts to assess the causal role of 
particular factors in children's behavior. For ex- 
ample. to investigate children's response to lead- 
ing questions regarding child abuse, Tobey and 
Goodman (1992) tested in a laboratory setting the 
accuracy of 4-year-old children's answers to ques- 
tions about their play with a "baby-sitter" after 
they had heard a "police officer" accuse her ot 
wrongdoing. 
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Ecologically valid “staged events” can have 
ethical costs, however, if debriefing does not dis- 
abuse the child of an experimentally induced false 
belief. Attempting to simulate stressful situations 
as they naturally occur, while maintaining inter- 
nal controls at the same time, is often difficult, if 
not impossible. For example, the researcher can- 
not take a child away from one of his or her parents 
to replicate the impact of certain custody deci- 
sions; nor would it be ethical to provide adoles- 
cents with inaccurate health information to test 
their decisions about matiers critical to their well- 
being; nor can we subject children to physical or 
sexual assault to study their reactions to abuse. 
And while observations in the field can generate 
rich descriptive information, they cannot provide 
causal explanations (Goodman & Tobey, 1994). 
As a consequence, the application of develop- 
mental research to legal questions will always be 
limited. 

Given inese limitations, developmentalists 
who conduct empirical studies relevant to the 
court's interests bear certain ethical responsibili- 
ties to assure that ( 1 ) the manipulation or setting 
resembles as closely as possible the phenomena of 
interest; (2) measures are reliable and experimen- 
tal conditions adequately controlled; (3) data are 
presented fairly; and (4) conclusions drawn do not 
go beyond the data. 

The developmental scientist as expert wit- 
ness. The developmentalist who is called upon to 
testify to the court faces several ethical chal- 
lenges. First is the obligation to present research 
findings honestly and fairly. This can be difficult 
on two fronts: ( 1 ) in that findings are rarely un- 
equivocal, an evenhanded testimony can lead to a 
mixed conclusion, with seemingly inconsistent 
data and diverse interpretations — as exemplified 
by current research on the suggestibility of child 
witnesses (Ceci & Bruck, 1993a, 1993b); (2) be- 
cause science and the legal system have different 



purposes (i.e., searching for truth in the case of 
science, and securing justice in the case of law 
[Melton, 1990b; Melton & Limber, 1989|) objec- 
tive evidence can become distorted in the process. 
Accordingly, a researcher’s first ethical responsi- 
bility as an expert witness is to be competent — not 
only in mastery of the developmental literature 
appropriate to the case at hand, but in familiarity 
with legal procedures and the skills and knowl- 
edge underlying forensic work. 

A second challenge facing the develop- 
mentalist as expert witness is to avoid the pitfalls 
of advocating for a particular side of a legal case. 
Since the expert witness is typically retained by an 
attorney on one side of a legal debate, it is tempt- 
ing to view ones duty as providing testimony that 
will help “win” the case. However, such a position 
is inappropriate for the expert whose ethical re- 
sponsibility is rather to be an advocate for the data 
(Melton. 1990b). This means that the expert is 
obliged to provide an accurate summary of scien- 
tific information to assist the finders of facts (the 
judge and jurors) in weighing the evidence put 
before them. 

A third ethical responsibility of the expert 
witness is to avoid generalizing findings derived 
from group means to the individual case. Unlike 
other witnesses, the expert witness is allowed to 
answer “hypothetical" questions. This means that 
the developmental researcher will be asked to 
render an opinion about the facts brought before 
the court. The expert witness must be clear at this 
point about the difference between the probabilis- 
tic nature of science (e.g., that some observed 
difference between groups of individuals did not 
arise by chance but rather at some level of prob- 
able cause) and the all-or-none nature of justice 
(e.g., that an individual is or is not found guilty). 
For example, while one study showed that 84 7r of 
the child participants continued to respond accu- 
rately about the absence of abuse when asked 



misleading questions (Tobey & Goodman. 1 992 ). 
an overgeneralized conclusion based on this find- 
ing could be disastrous for a child abuse defendant 
whose accuser reflects the 1 3 c /c of those children 
who do not respond accurately. Similarly, while 
research has indicated that 509f of 3- and 4-year- 
olds will occasionally distort the truth to achieve 
some aim (Ceci & Leichtman. 1992). an attempt 
to discredit a child's testimony based on this 
finding can be extremely harmful to a youngster 
accurately relaying her abusive experiences in 
court. The developmentalist must clarify for the 
judge and jury the limitations of applying empiri- 
cal findings based on probability and group means 
to the determination of facts about an individual 
(Melton. 1990b). 

Recommendations for Future Policy 

Until recently there has been little dialogue 
or consensus on the complex issues discussed 
here. However, as developmental research in- 
volves more and more high-risk children and 
adolescents, these issues are becoming more sa- 
lient. The recommendations that follow are in- 
tended to provide a framework for reshaping fed- 
eral policy on the ethical conduct of research. 

Specifying the Conditions of Requiring the 
Guardian Consent Waiver 

Federal guidelines on the waiver of guardian 
consent must be more specific. At present, excep- 
tions to the general requirements for informed 
consent put forth by the National Institutes of 
Health Office for Protection of Research Risks 
(OPRR) are inconsistently applied across institu- 
tional review boards, grant review panels, and the 
OPRR itself in reviews of research on specific 
populations (e.g.. runaways, children from abu- 
sive homes, pregnant adolescents). For example, 
according to Title 45. Part 46 of the Code of 



Federal Regulations (DHHS, 1991), informed 
consent may be waived provided the 1RB docu- 
ments that the research investigates factors rel- 
evant to public service programs and is subject to 
government agency approval, or that the research 
could not be practically carried out without waiver. 
While this provision appears to give developmen- 
tal researchers wide latitude in gaining a guardian 
consent waiver, additional guidelines have been 
invoked to preclude a majority of such waivers — 
namely, section 1 16.d.l stating that “the research 
involves no more than minimal risk to the sub- 
jects” (where the probability and magnitude of 
harm is not greater than ordinarily encountered in 
daily life); and section 1 16.d.2 stating that “the 
waiver or alteration will not adversely affect the 
rights and welfare of the subjects” (DHHS, 1991 , 

p. 10). 

Consistency of reviews. In judging the ethi- 
cal acceptability of a research proposal, IRBs, 
grant review panels, and the OPRR often disagree 
in their evaluations of “minimal risk” or “adverse 
effect” (Freedman. Fuks, & Weijer, 1993). While 
the checks and balances provided by the indepen- 
dent evaluation of proposals helps protect partici- 
pants, it is important that standards not be misap- 
plied. Care must be taken to avoid inadvertently 
discouraging studies of high-risk children or lead- 
ing researchers to believe that Title 45, Part 46 of 
the Code of Federal Regulations implies that 
informed consent is not required, with or without 
waiver. Here are some issues that need clarifica- 
tion: 

• What experimental conditions meet 
the criteria of “minimal risk”? At 
present it is not clear whether anony- 
mous surveys or confidential inter- 
views meet federal criteria, nor to 
what extent the child's age or other 
child characteristics should influence 
decisions about minimal ris*k. 



• What types of survey or interview 
content are to be considered poten- 
tially adverse or damaging to minor 
participants? 

• What is the potential negative impact 
of waiving guardian consent in the 
case of descriptive versus interven- 
tion studies? This is an especially 
important question for studies of the 
health-compromising behaviors of 
adolescent populations. 

• How can “risk" be better defined 
across diverse populations? In that 
disadvantaged youth may be subject 
to more everyday risks than their more 
advantaged counterparts, an “every- 
day" criterion may inadvertently lead 
to unjust policies (Kopelman, 1981; 
Scott-Jones. in press). Furthermore, 
using an “advantaged" norm to de- 
fine minimal risk may exclude disad- 
vantaged children from research that 
could benefit them. 

• How is it to be determined that an 
adolescent does not have a legal 
guardian, or has a guardian unable to 
make decisions in the youth’s best 
interest? 

Participant advocates. Federal guidelines 
(DHHS, 1983), recognizing that some children 
and adolescents do not have a “loving" relation- 
ship with a guardian, recommend that under such 
conditions IRBs appoint an advocate to be present 
during the informed consent process. Currently, 
however, the identification and utilization of ad- 
vocates to represent minor participants has been 
limited to institutionalized populations. 

The critical need for empirical data on the 
lives of emancipated adolescents without guard- 



ians calls for federal support for a system of 
participant advocacy. This system would require 
that a researcher who is granted a waiver of guard- 
ian consent appoint an individual(s) with no in- 
vestment in the research project or role in subject 
recruitment to serve as an advocate for the minors 
being recruited for participation. The responsi- 
bilities of the participant advocate would include 
evaluating each youth’s understanding of informed 
consent procedures, ensuring that participation is 
voluntary, checking periodically to determine 
whether the youth wants to terminate participa- 
tion, assessing the participant’s reactions to re- 
search procedures, ensuring that the debriefing 
addresses all participant questions and concerns, 
and serving as a liaison to referral services. 

Federal Guidelines for Reporting and Referring 
Research Participants 

As discussed earlier, research involving chil- 
dren and adolescents from high-risk environments 
has the potential to tap sources of psychopathol- 
ogy, psychological distress, or physical vulner- 
ability that jeopardize participants' welfare. Fed- 
eral guidelines are needed to articulate the 
investigator’s responsibility to share or report 
such information. These guidelines will help re- 
searchers design informed consent procedures and 
establish, where appropriate, referring and report- 
ing procedures that offer reasonable protections 
for minor participants and their families. 

Clarifying the relationship between federal 
and state regulations is also critical to the scien- 
tific community’s ability to design procedures 
that protect not only the welfare but the privacy 
rights of participants and their families. Research- 
ers working with high-risk and disadvantaged 
populations need guidance in the following areas: 

• Under what conditions does the fed- 
eral Certificate of Confidentiality 
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supersede state reporting laws cover- 
ing current and past abusive behav- 
iors? 

• What are the legal risks to research- 
ers who decide to disclose informa- 
tion to protect participant welfare af- 
ter having obtained a Certificate and 
having promised participants confi- 
dentiality? 

• Are researchers under the same obli- 
gation as mental health practitioners 
to report a "suspicion" of child abuse, 
or does a more stringent criterion for 
evaluating child jeopardy apply? 

• What legal protections, if any. are 
available to the researcher whose re- 
port of a "suspicion" of abuse proves 
unfounded? 

Expanded Federal Funding 

In setting child and adolescent welfare as a 
national priority, the federal government runs the 
risk of creating false expectations if funding is 
inadequate to produce ethical practice and empiri- 
cally sound results (Baumrind. 1990). But federal 
agencies can help investigators balance their sci- 
entific and ethical responsibilities by funding a 
variety of efforts: ( I ) innovative ethical practices, 

( 2 ) multistrategy intervention designs that include 
basic services to control groups, (3) research on 
participant reactions to ethical procedures, and (4) 
applied developmental science training programs. 

Innovative practices. Investigators need to 
build into their projects requests for federal sup- 
port for design components aimed at protecting 
the rights and promoting the welfare of research 
participants. Funding is needed to support more 
active efforts to obtain guardian consent within 
the neighborhoods where potential high-risk par- 



ticipants live. This kind of recruitment requires 
funds for pilot work to establish the most effica- 
cious procedures and to train and pay research 
personnel for the demanding, labor-intensive task 
of establishing personal, often door-to-door rela- 
tionships with the community and with potential 
participants and their guardians. 

Funding is also needed to enhance the poten- 
tial benefits of participation in research. For ex- 
ample, while subjects typically derive little or no 
direct benefit from their participation, the debrief- 
ing stage of a study offers an ideal opportunity for 
children and youth to learn how the scientific 
method is applied to the study of human develop- 
ment, or how to avail themselves of mental health 
or social services in the community. Supporting 
budgetary requests for these aspects of research 
can help introduce a new generation to the meth- 
ods of human experimentation and link the idea of 
good science to “doing good." 

Multistrategy design. Federal agencies re- 
sponsible for social service programs should cease 
expecting and demanding that the first interven- 
tion strategy tried be successful, and investigators 
should strengthen their requests for support of 
multistrategy intervention designs. Program evalu- 
ation of different treatment approaches for chil- 
dren and adolescents varying in cognitive skills, 
socioemotional adaptation, and cultural and eco- 
nomic background need to be encouraged. More 
intervention studies requesting and receiving sup- 
port for offering control group participants basic 
services are also needed. 

A "science of scientific ethics. " Vulnerable 
children and youth also stand to benefit from a 
federally sponsored initiative to investigate par- 
ticipant and guardian responses to current ethical 
practices and to the enhanced ethical procedures 
being proposed here (Hoagwood, Jensen, & 
Leshner. in press). Research is needed to inform 
the design of ethical practices that will respect 



participant autonomy, promote participant wel- 
fare. and distribute fairly the costs and benefits of 
scientific inquiry. Following are some of the is- 
sues needing investigation: 

• What determines guardian consent 
and participant assent among chil- 
dren and adolescents who differ in 
actual and perceived risk? 

• What is the impact of monetary in- 
centives on participation? 

• How do participants and families re- 
act to random assignment to control 
groups in intervention studies, and 
what characteristics distinguish those 
who agree and those who refuse to 
participate in random assignment? 

• What type of information do partici- 
pants and their families want to re- 
ceive when a study is over? 

• Under what conditions do partici- 
pants and parents believe researchers 
should report problems identified 
during the course of research? 

• Do members of the focal community 
and research scientists differ in their 
evaluation of the scientific value and 
participant costs of current models of 
developmental research? 

A federal research initiative to generate em- 
pirical data on participant perspectives holds the 
promise not only of building mutual respect be- 
tween investigator and participant, but of helping 
to shape ethical practices (Fisher, in press: F'sher 
& Fyrberg, in press). 

Training programs. Expanded federal fund- 
ing is needed to promote the training of research- 
ers who can work with parents, practitioners, 
social service agencies, and policymakers ( Fisher. 



etal., 1993; Fisher, Rau, & Colapietro, 1993). For 
example, 

• New curricula should promote ethics 
education (to include ethical deci- 
sion making), familiarity with fed- 
eral and professional ethics codes, 
and awareness of state and federal 
laws pertinent to research with high- 
risk children and youth. 

• Students must also acquire the exper- 
tise to (1) evaluate the validity of 
assessment instruments for diverse 
populations, (2) appraise the multi- 
level responses of participants to so- 
cial programs, and (3) evaluate and 
help develop multistrategy interven- 
tions. 

• Federal support should encourage 
curricula that provide students with 
the ?•'; ills to disseminate developmen- 
tal findings clearly and fairly in both 
legal and political forums. 

Conclusion 

As developmental scientists leave the labo- 
ratory to investigate the real world challenges 
facing high-risk populations, the ethical obliga- 
tions to participant welfare become exceedingly 
complex. These new pressures place the investi- 
gator in the “scientist-citizen" dilemma whereby 
competing obligations to produce scientifically 
rigorous knowledge and to protect and promote 
the welfare of research participants must be 
balanced (Fisher & Tryon, 1990: Scarr. in press: 
Veateh. 1987). The current cost-benefit approach 
to ethical decision making often leads investiga- 
tors to look at scientific rigor and social responsi- 
bility as incompatible goals — as either/or possi- 
bilities (Fisher, in press). This report presents a 



framework for integrating these two purposes. 
Researchers and policymakers must work together 
to develop procedures and experimental designs 
that enable investigators to fulfill their obligation 
to a research ethic of scientific responsibility and 
care. 
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